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OPTICAL DISC, VIDEO DATA EDITING APPARATUS, 
COMPUTER- READABLE RECORDING MEDIUM STORING AN EDITING 
PROGRAM, REPRODUCTION APPARATUS FOR THE OPTICAL DISC, 
AND COMPUTER-READABLE RECORDING MEDIUM STORING AN 
REPRODUCTION PROGRAM 



FTF.T.n QE THE IN VENTION 

The present invention relates to an optical disc, 
a video data editing apparatus, a computer-readable 
recording medium that stores an editing program, a 
reproduction apparatus for the optical disc, and a 
computer-readable recording medium that stores a 
reproduction program. 

B&gggEQIZ Mfi THE INVENTION 

Video editors in the film and broadcasting 
industries make full use of their skill and experience 
when editing the great variety of video productions that 
reach the market. While movie fans and home video 
makers may not possess such skill or experience, many 
are still inspired by professional editing to try video 
editing for themselves. This creates a demand for a 
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domestic video editing apparatus that can perform 
advanced video editing while still being easy to use. 

While video editing generally involves a variety 
of operations, domestic video editing apparatuses that 
5 are likely to appear on the market in the near future 
will especially require an advanced scene linking 
function. Such function links a number of scenes to 
form a single work. 

When linking scenes using conventional domestic 

10 equipment, the user connects two video cassette 

recorders to form a dubbing system. The operations 
performed when linking scenes using this kind of dubbing 
system are described below. 

Fig. 1A shows a video editing setup using video 

15 cassette recorders that are respectively capable of 

recording and playing back video signals. The setup of 
Fig. 1A includes the video cassette 301 that records the 
source video, the video cassette 302 for recording the 
editing result, and two video cassette recorders 303 and 

20 304 for playing back and recording video images on the 
video cassettes 301 and 302. In this example, the user 
attempts to perform the editing operation shown in Fig. 
IB using the setup of Fig. 1A. 

Fig. IB show the relationship between the 

25 material to be edited and the editing result. In this 

2 
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example, the user plays back scene 505 that is located 
between time t5 and time tlO of the source material, 
scene 506 that is located between time tl3 and t21, and 
scene 507 that is located between time t23 and t25 and 
5 attempts to produce and editing result that is only 
composed of these scenes. 

With the setup of Fig. 1A, the user sets the 
video cassette 301 including the source material into 
the video cassette recorder 303 and the video cassette 

10 302 for recording the editing result into the video 
cassette recorder 304. 

After setting the video cassettes 301 and 302, 
the user presses the fast-forward button on the 
operation panel of the video cassette recorder 303 (as 

15 shown by © in Fig. 1A) to search for the start of scene 
505. Next, the user presses the play button on the 
operation panel of the video cassette recorder 303 (as 
shown by © in Fig. 1A) to reproduce scene 505. At the 
same time, the user presses the record button on the 

20 operation panel of the video cassette recorder 304 (as 
shown by © in Fig. 1A) to commence recording. When 
scene 505 has finished, the user stops the operation of 
both video cassette recorders 303 and 304. The user 
then fast-forwards the video cassette to the start of 

25 scene 506, and then simultaneously commences the 
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playback by video cassette recorder 303 and the 
recording by video cassette recorder 304. After 
completing the above process for scenes 506 and 507, the 
user has the video cassette recorders 303 and 304 
5 respectively rewind the video cassettes 301 and 302 to 
complete the editing operation. 

If the scene linking operation described above 
could be performed with ease at the home, users would 
then be able to easily manage programs that have been 

10 recorded on a large number of magnetic tape cassettes. 

A first problem with the video editing setup 
described above though is that the source material and 
editing result need to be recorded on separate recording 
media, meaning that two video cassette recorders need to 

15 be used for playing back and recording the respective 
recording media. This greatly increases the scale of 
the video editing setup. Since video editing can only 
be performed in a place where it is possible to connect 
two video cassette recorders, this means that a large 

20 space is required to perform the editing operation. 



A second problem with the video editing setup 
described above is that when the user wishes to perform a 
scene linking operation, the user has to repeat the 
25 processes of locating the start of the desired scene and 

4 
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reproducing all of the video images from the beginning to 
the end of the scene for each scene to be linked. Here, the 
larger the number of scenes to be linked, the greater the 
burden of locating the start of each scene and reproducing 
5 the scene, meaning that the complex operations end up taking 
a considerable amount of time. 

When a professional editor performs scene linking, 
instead of producing the editing result in one attempt, it 
is common for the editor to repeatedly amend the scene 
10 linking order so that high-quality results can finally be 
achieved. When using a setup where locating the start and 
reproducing scenes takes so much trouble, it is very 
difficult to perform such repeated amendment of the scene 
linking order. 

15 These problems can be thought of as being caused by 

the use of magnetic tape as the recording medium, so that 
improvements could be made by using a video editing setup 
that utilizes a recording medium which allows random access, 
such as a hard disc or phase change-type optical disc. 

20 As one example, if an optical disc were used to 

store the editing material and the editing result could be 
stored on the same optical disc, video editing would then be 
possible using only one video data editing apparatus that 
uses an optical disc as a recording medium, thereby greatly 

25 reducing the scale of the editing equipment. However, if 
both the editing source material and editing result are 
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stored on the same optical disc, there can be cases when the 
editing result ends up being overwritten over the editing 
source material- In such a case, if the editor later wishes 
to change the editing result, the source material will have 
5 been overwritten, meaning that the editor will not be able 
to redo the editing using the original source materials. 

When the editing source materials are of great 
personal value, such as footage of a child's school entrance 
ceremony, a school sports day, a family holiday, or a 

10 graduation ceremony, the overwriting of the source materials 
denies the user the chance to watch such important events 
again, let alone the chance to re-edit them. If the 
recording medium has a capacity that is greater than double 
the size of the source materials, an editing operation could 

15 presumably be performed without overwriting the source 

materials. However, for phase-change optical discs that are 
the most advanced recording medium, the recording capacity 
is still only 2.6GB on one side, so that it is not possible 
to record video images with a reproduction time of greater 

20 than two hours separately as source materials and editing 
results. Also, if the user wishes to create several 
intermediate versions and record these separately on the 
optical disc to allow the selection of the best one at a 
later date, a recording disc with three or four times the 

25 date size of the audio video data (data produced by 

multiplexing video data and audio data) will be required. 
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It can therefore be readily understood that the storage 
capacity of a single disc is insufficient. 

A third problem with the video editing setup 
described above is that the sections to be linked cannot be 
5 precisely indicated. When performing the editing shown in 
Fig. 1A, the user needs to press the play button of one 
video cassette recorder at the same time as the record 
button on the other video cassette recorder. If the user 
presses one of these buttons before the other, there is the 
10 problem that an undesired part will end up in the recording 
result, or that the editing result will not include the 
start of the desired part. 



DISCLOSURE OF THE INVENTION 

15 It is a first object of the present invention to 

provide an optical disc and a video data editing apparatus 
that uses the optical disc as a recording medium, the 
optical disc enabling video that is already recorded on the 
disc to be edited on the disc itself without overwriting. 

20 It is a second object of the present invention to 

provide an optical disc and a video data editing apparatus 
that uses the optical disc as a recording medium, the 
optical disc being able to store not merely a final result 
of video editing, but also a number of separate intermediate 

25 video editing patterns, with the user being able to select a 
most suitable of the intermediate video editing patterns at 
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a later date. 

It is a third object of the present invention to 
provide an optical disc and a reproduction apparatus for the 
optical disc, the optical disc allowing the user to indicate 
5 sections that are to be subject to editing with high 
precision during the display of video images . 

The first object and second object can be achieved 
by an optical disc, including: a data area for recording a 
file including at least one video object; and an index area 

10 for recording original type chain information and at least 

one set of user-defined type chain information, the original 
type chain information managing the file as an arrangement 
of a plurality of file sections, each file section being 
indicated by section boundaries that are a combination of 

15 any two of - (a) a start position of a video object, (b) an 
end position of a video object, and (c) at least one 
predetermined position within a video object, each set of 
user-defined type chain information indicating a plurality 
of file parts in the file and a reproduction route for the 

20 indicated file parts, each file part being indicating by 

part boundaries, the part boundaries being any of - (a) two 
of the section boundaries, (b) one of the section boundaries 
and a position in a video object that differs from the 
predetermined positions, (c) two positions in a video object 

25 that differ from the predetermined positions, each 

reproduction route being independent of an order in which 
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the indicated file parts appear in the file. 

The original type chain information can be indicate 
that sections are produced in the order in which they are 
arranged in a video object. The user-defined type chain 
5 information can indicate a provisionally decided 

reproduction route for a work produced by video editing of 
the plurality of sections included in a video object. 

The original type chain information can indicate the 
finally determined reproduction order when a video object in 

10 the data area is processed by a video data editing apparatus 
in accordance with a set of user-defined type chain 
information, or when the video object in the data area is 
overwritten. 

The chain information is used during editing 

15 operations, so that the user can soon generate reproduction 
routes for his/her desired sequences of scenes by defining 
sets of user-defined type chain information. By reproducing 
video in accordance with a set of user-defined type chain 
information, the user is able to verify the content of a 

20 provisionally determined reproduction route. 

This provisional determination of a reproduction 
route can be easily performed in a short time by defining a 
set of user-defined type chain information. Since the data 
size of a set of user-defined type chain information is 

25 negligible, there is no risk of the video object being 

accidently overwritten by a set of user-defined type chain 
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information . 

When a disc records video images of great personal 
value, the user may provisionally determine a reproduction 
route for the video on the disc with no danger of the 
5 valuable images being overwritten or lost. 

By defining a plurality of sets of user-defined type 
chain information, by preforming reproduction of each and 
then selecting the best for a real edit, the user can 
perform a bold editing operation that directly rewrites the 

10 content of the video objects on the optical disc. While the 
original video objects will be lost, the user will have had 
ample chance to confirm the result of the real edit and so 
should be satisfied with the result. 

The third object of the present invention can be 

15 achieved by a video data editing apparatus that uses an 
optical disc as an editing medium, the optical disc 
including: a data area for recording a file including at 
least one video object; and an index area for recording 
original type chain information and at least one set of 

20 user-defined type chain information, the original type chain 
information managing the file as an arrangement of a 
plurality of file sections, each file section being 
indicated by section boundaries that are a combination of 
any two of - (a) a start position of a video object, (b) an 

25 end position of a video object, and (c) at least one 

predetermined position within a video object, each set of 

10 
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user-defined type chain information indicating a plurality 
of file parts in the file and a reproduction route for the 
indicated file parts, each file part being indicating by 
part boundaries, the part boundaries being any of - (a) two 
5 of the section boundaries, (b) one of the section boundaries 
and a position in a video object that differs from the 
predetermined positions, (c) two positions in a video object 
that differ from the predetermined positions, each 
reproduction route being independent of an order in which 

10 the indicated file parts appear in the file, the video data 
editing apparatus including: a reception unit for receiving 
an operation from a user; a processing unit for processing 
the file, when the reception means has received an 
indication of a real edit operation for one of the sets of 

15 user-defined type chain information recorded on the optical 
disc, so that starting and ending part boundaries indicated 
in the indicated set of user-defined type chain information 
become boundaries of the video objects recorded on the 
optical disc; and an updating unit for updating, after 

20 processing by the processing unit, the indicated set of 
user-defined type chain information in the index area to 
convert the indicated set of user-defined type chain 
information into the original type chain information. 

Each video object may includes a plurality of video 

25 object units, each video object unit including a plurality 
of sets of picture data that are reproduced for a plurality 
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of video frames for a certain reproduction period, and each 
set of cell information may include: time information 
including a presentation start time information and 
presentation end time information for video data in a cell, 
5 the presentation start time information showing a display- 
video field of a set of picture data that should be 
reproduced first in a part of a video object and the 
presentation end time information showing a display video 
field of a set of picture data that should be reproduced 

10 last in the part, and the cell being one of a file section 
and a file part; and identification information for 
indicating the video object to which the cell belongs, 
wherein the reception unit may receive a play indication for 
one of the sets of user-defined type chain information and 

15 wherein the video data editing apparatus may further 

include: an access position specifying unit for reading, 
when the reception unit has received a play indication, 
mapping information and a pair of presentation start time 
information and presentation end time information from the 

20 index area, and for specifying, by searching the mapping 
information using the presentation start time information 
and the presentation end time information, a recording 
position of a start video object unit including picture data 
that is displayed for the presentation start time 

25 information and a recording position of an end video object 
unit including picture data that is displayed for the 
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presentation end time information; and a reading unit for 
reading a video object unit sequence recorded between the 
specified recording positions; and a decoding unit for 
decoding the read video object unit sequence and, for 
5 outputting, when part boundaries of a part corresponding to 
a set of cell information in the indicated set of user- 
defined type chain information do not match a start position 
of the start video object unit and an end position of the 
end video object unit sequence, a decoding result between a 

10 first video field and an end video field indicated by the 
cell information, and for prohibiting output of a decoding 
result of data before the first video field and data after 
the end video field. 

With the stated construction, the cell information 

15 specifies the video parts to be used in the editing to an 
accuracy of one video field. This means that the parts to 
be edited can be specified with very high precision. 



20 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects, advantages and features 
of the invention will become apparent from the following 
description thereof taken in conjunction with the 
25 accompanying drawings which illustrate a specific 
embodiment of the invention. In the drawings: 

13 
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Fig. 1A shows a convention video editing setup 
using video cassette recorders that are capable of 
playing back and recording video signals; 

Fig. IB shows the relationship between the source 
5 materials and the editing result; 

Fig. 2A shows the outward appearance of a DVD-RAM 
disc that is the recordable optical disc used in the 
embodiments of the present invention; 

Fig. 2B shows the recording areas on a DVD-RAM; 
10 Fig. 2C shows the cross-section and surface of a 

DVD-RAM cut at a sector header; 

Fig. 3A shows the zones 0 to 23 on a DVD-RAM; 

Fig. 3B shows the zones 0 to 23 arranged into a 
horizontal sequence; 
15 Fig. 3C shows the logical sector numbers (LSN) in 

the volume area; 

Fig. 3D shows the logical block numbers (LBN) in 
the volume area; 

Fig. 4A shows the contents of the data recorded 
20 in the volume area; 

Fig. 4B shows the hierarchical structure of the 
data definitions used in MPEG standard; 

Fig. 5A shows a plurality of sets of picture data 
arranged in display order and a plurality of sets of 
25 picture data arranged in coding order; 

14 
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Fig. 5B shows the correspondence between audio 
frames and audio data; 

Fig. 6A shows a detailed hierarchy of the logical 
formats in the data construction of a VOB (Video 
5 Object) ; 

Fig. 6B shows the partial deletion of a VOB; 

Fig. 6C shows the logical format of a video pack 
arranged at the start of a VOB; 

Fig. 6D shows logical format of other video packs 
10 arranged in a VOB; 

Fig. 6E shows the logical format of an audio 

pack; 

Fig. 6F shows the logical format of a pack 

header; 

15 Fig. 6G shows the logical format of a system 

header; 

Fig. 6H shows the logical format of a packet 

header; 

Fig. 7A shows a video frame and the occupancy of 
20 the video buffer; 

Fig. 7B shows an audio frame and an ideal 
transition in the buffer state of the audio buffer; 

Fig. 7C shows an audio frame and the actual 
transition in the buffer state of the audio buffer; 
25 Fig. 7D shows the detailed transfer period of 

15 
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each set of picture data; 

Fig. 8A shows how audio packs, which store the 
audio data to be reproduced in a plurality of audio 
frames, and video packs, which store the picture data 
5 that is to be reproduced in a plurality of video frames, 
may be recorded; 

Fig. 8B shows a key to the notation used in Fig. 

8A; 

Fig. 9 shows how audio packs, which store the 
10 audio data to be reproduced in a plurality of audio 

frames, and video packs, which store the picture data 
that is to be reproduced in a plurality of video frames, 
may be recorded; 

Fig. 10A shows the transition in the buffer state 
15 during for the first part of a video stream; 

Fig. 10B shows the transition in the buffer state 
during for the last part of a video stream; 

Fig. 10C shows the transition in the buffer state 
across two VOBs, when the video stream whose last part 
20 causes the buffer state shown in Fig. 10B is seamlessly 
linked to the video stream whose former part causes the 
buffer state shown in Fig. 10A; 

Fig. 11A is a graph where the SCRs of video packs 
included in a VOB are plotted in the order in which the 
25 video packs are arranged; 
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Fig. 11B shows an example where the first SCR in 
section B matches the last SCR in section A; 

Fig. 11C shows an example where the first SCR in 
section D is higher than the last SCR in section C; 
5 Fig. 11D shows an example where the last SCR in 

section E is higher than the first SCR in section F; 

Fig. HE shows the graph for the continuity of 
VOBs of Fig. HA for two specific VOBs; 

Fig. 12A shows a detailed expansion of the data 
10 hierarchy in the RTRW management file; 

Fig. 12B shows the PTM descriptor format; 

Fig. 12C shows the data construction of the audio 
gap location information; 

Fig. 13 shows the buffer occupancy for each of a 
15 former VOB and a latter VOB; 

Fig. 14A shows examples of audio frames and video 

frames; 

Fig. 14B shows the time difference gl that 
appears at the end of the audio data and picture data 
20 when the reproduction time of picture data and the 

reproduction time of audio data are aligned at the start 
of a VOB; 

Fig. 14C shows the audio pack G3 including the 
audio gap and the audio pack G4, audio pack G3 including 
25 (i) the sets of audio data y-2, y-1, and y, which are 

17 
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located at the end of VOB#l, and (ii) the 
Padding_Packet , and audio pack G4 including the sets of 
audio data u, u+1, and u+2, which are located at the 
start of V0B#2; 

5 Fig. 14D shows into which of VOBU#l, V0BU#2, and 

V0BU#3 at the start of the V0B#2 the audio pack G3 
including the audio gap is arranged; 

Figs. 15A to 15D show the procedure for the 
regeneration of the audio gap when the VOBUs located at 
10 the start of VOB#2, out of the VOBs #1 and #2 that are 
to be reproduced seamlessly, are deleted; 

Fig. 16 show an example system configuration 
using the video data editing apparatus of the first 
embodiment; 

15 Fig. 17 is a block diagram showing the hardware 

construction of the DVD recorder 70; 

Fig. 18 shows the construction the MPEG encoder 

2; 

Fig. 19 shows the construction of the MPEG 
20 decoder 4; 

Fig. 20 is a timing chart showing the timing for 
the switching of switches SW1 to SW4; 

Fig. 21 is a flowchart showing the procedure of 
the seamless processing; 
25 Fig. 22 is also a flowchart showing the procedure 
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of the seamless processing; 

Figs. 23A and 23B show the analysis of transition 
in the buffer state for audio packs; 

Fig. 23C shows the area that is to be read from 
5 the former VOB in step S106; 

Fig. 23D shows the area that is to be read from 
the latter VOB in step S107; 

Fig. 24A shows the audio frames in the audio 
stream that correspond to the audio frames x, x+1, y, u, 
10 u+1, u+2 used in Fig. 22; 

Fig. 24B shows the case when the 
First_SCR+STC_of f set corresponds to a boundary between 
audio frames in the former VOB; 

Fig. 24C shows the case when the video 
15 reproduction start time VOB_VJS_PTM+STC_of f set 

corresponds to a boundary between audio frames in the 
former VOB; 

Fig. 24D shows the case when the presentation end 
time of the video frame y corresponds to a boundary 
20 between audio frames in the latter VOB; 

Fig. 25 shows how the audio packs storing audio 
data for a plurality of audio frames and the video packs 
storing video data for each video frame are multiplexed; 

Fig. 2 6 shows an example of the section of a VOB 
25 that is specified using time information for a pair of 

19 
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C_V__S_PTM and C_VJE_PTM; 

Fig. 27A shows the area that is to be read from 
the former cell in step S106; 

Fig. 27B shows the area that is to be read from 
5 the latter cell in step S107; 

Fig. 28A shows an example of the linking of sets 
of cell information that are specified as the editing 
boundaries in a VOBU; 

Fig. 28B shows the processing for the three rules 
10 for reconstructing GOPs when correcting the display 
order and coding order; 

Fig. 2 9A shows the processing when changing a 
picture type of picture data in the former cell; 

Fig. 2 9B shows the procedure for measuring the 
15 change 3 in the buffer occupancy when changing a picture 
type in the former cell; 

Fig. 30A shows the processing where changing the 
picture type of the latter cell; 

Fig. 30B shows the procedure for measuring the 
20 change a in the buffer occupancy when changing a picture 
type in the latter cell; 

Fig. 31 is a flowchart showing the procedure for 
the seamless processing; 

Fig. 32 is also a flowchart showing the procedure 
25 for the seamless processing; 

20 
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Fig. 33 is also a flowchart showing the procedure 
for the seamless processing; 

Fig. 34 shows the audio frames in the audio 
stream that correspond to the audio frames x, x+1, and y 
5 used in the flowchart of Fig. 31; 

Fig. 35 shows the hierarchical directory 
structure; 

Fig. 36 shows the information, aside from the 
sector management table and AV block management table 
10 shown in Fig. 6, in the management information for the 
file system; 

Fig. 37 shows the linked relations shown by the 
arrows in Fig. 6 within the directory structure; 

Fig. 38A shows the data construction of file 
15 entries in greater detail; 

Fig. 38B shows the data construction of the 
allocation descriptors; 

Fig. 38C shows the recorded state of the upper 2 
bits in the data shows the extent length; 
20 Fig. 39A shows the detailed data construction of 

the file identification descriptor for a directory; 

Fig. 39B shows the detailed data construction of 
the file identification descriptor for a file; 

Fig. 40 is a model showing the buffering in the 
25 track buffer of AV data read from the DVD-RAM; 

21 
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Fig. 41 is a functional block diagram showing the 
construction of the DVD recorder 70 divided by function; 

Fig. 42 shows an example of an interactive screen 
displayed on the TV monitor 72 under the control of the 
5 recording-editing-reproduction control unit 12; 

Fig. 43 is a flowchart showing the processing by 
the recording-editing-reproduction control unit 12 for a 
virtual edit and for a real edit; 

Figs. 4 4A to 44F show a supplementary example to 
10 illustrate the processing of the AV data editing unit 15 
in the flowchart of Fig. 43; 

Figs. 45A to 45E show a supplementary example to 
illustrate the processing of the AV data editing unit 15 
in the flowchart of Fig. 43; 
15 Figs. 4 6A to 4 6F show a supplementary example to 

illustrate the processing of the AV data editing unit 15 
in the flowchart of Fig. 43; 

Fig. 47A shows the relationship between the 
extents and the in-memory data, in terms of time; 
20 Fig. 47B shows the positional relationship 

between the extents, the In area and the Out area; 

Fig. 48A is a flowchart showing the processing by 
the AV file system unit 11 when executing a "SPLIT" 
command; 

25 Fig. 48B is a flowchart showing the processing 
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when executing a "SHORTEN" command is issued; 

Fig. 48B is a flowchart showing the processing 
when executing a "MERGE" command is issued; 

Fig. 50 is a flowchart for the case when the 
former extent is below AV block length but the latter 
extent is at least equal to AV block length; 

Figs. 51A-51B are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 50; 

Figs. 52A to 52C are a supplementary example 
showing the processing of the AV file system unit 11 in 
the flowchart of Fig. 50; 

Figs. 53A to 53D are a supplementary example 
showing the processing of the AV file system unit 11 in 
the flowchart of Fig. 50; 

Figs. 54A-54D are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 50; 

Fig. 55 is a flowchart for the case when the 
former extent is at least equal to AV block length but 
the latter extent is below AV block length; 

Figs. 56A-56B are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 55; 

Figs. 57A-57C are a supplementary example showing 
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the processing of the AV file system unit 11 in the 
flowchart of Fig. 55; 

Figs. 53A-53D are a supplementary example showing 
the processing of the AV file system unit 11 in the 
5 flowchart of Fig. 55; 

Figs. 54A-54D are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 55; 

Fig. 60 is a flowchart for the case when the both 
10 the former extent and the latter extent are below AV 
block length; 

Figs. 61A-61D are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 60; 
15 Figs. 62A-62C are a supplementary example showing 

the processing of the AV file system unit 11 in the 
flowchart of Fig. 60; 

Figs. 63A-63C are a supplementary example showing 
the processing of the AV file system unit 11 in the 
20 flowchart of Fig. 60; 

Figs. 64A-64D are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 60; 

Fig. 65 is a flowchart for the case when the both 
25 the former extent and the latter extent are at least 
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equal to AV block length; 

Figs. 66A-66D are a supplementary example showing 
the processing of the AV file system unit 11 in the 
flowchart of Fig. 65; 
5 Fig. 67 is a flowchart showing the case when the 

both the former extent and the latter extent are at 
least equal to AV block length but the data sizes of the 
In area and Out area are insufficient; 

Figs. 68A-68E are a supplementary example showing 
10 the processing of the AV file system unit 11 in the 
flowchart of Fig. 67; 

Figs. 69A-69D are a supplementary example showing 
the processing of the def ragmentation unit 16; 

Fig. 70A shows the detailed hierarchical content 
15 of the RTRW management file in the fourth embodiment; 

Fig. 70B is a flowchart showing the logical 
format of the original PGC information in the fourth 
embodiment; 

Fig. 70C is a flowchart showing the logical 
20 format of the user-defined PGC information in the fourth 
embodiment; 

Fig. 70D shows the logical format of the title 
search pointer; 

Fig. 71 shows the inter-relationships between the 
25 AV file, the extents, the VOBs, the VOB information, the 

25 
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original PGC information, and the user-defined PGC 
information, with the unified elements being enclosed in 
the frames drawn with the heavy lines; 

Fig. 72 shows an example of a user-defined PGC 
5 and an original PGC; 

Fig, 73 shows the part that corresponds to the 
cell to be deleted using diagonal shading; 

Fig. 74A shows which ECC blocks are freed into 
empty areas by a real edit using the user-defined PGC 
10 information #2; 

Fig. 74B shows examples of VOBs, VOB information, 
and PGC information after a real edit; 

Fig. 75 is a functional block diagram shown the 
construction of the DVD recorder 70 divided according to 
15 function; 

Fig. 7 6 shows an example of original PGC 
information that has been generated by the user-defined 
PGC information generator 25 when recording an AV file; 

Fig. 77A shows an example of graphics data that 
20 is displayed on the TV monitor 72 under the control of 
the recording-editing-reproduction control unit 12: 

Fig. 77B shows an example of the PGC information 
and cell information that are displayed as a list of 
operation targets ; 
25 Fig. 78A is a flowchart shows the processing 
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during partial reproduction of a title; 

Fig. 78B shows how only the section between the 
presentation start time C_V_S_PTM and the presentation 
end time C_V_E_PTM is reproduced, out of the VOBUs 
5 between the VOBU (START) and the VOBU (END) ; 

Figs. 7 9A, 7 9B show the user pressing the mark 
key while viewing video images on the TV monitor 72; 

Figs. 80A, 80B show how data is inputted and 
outputted between the components shown in Fig. 75 when a 
10 marking operation is performed; 

Fig. 81 is a flowchart showing the processing of 
the editing multi-stage control unit 2 6 when defining 
user-defined PGC information; 

Fig. 82 is a flowchart showing the processing of 
15 the editing multi-stage control unit 2 6 when defining 
user-defined PGC information; 

Fig. 83 is a flowchart showing the processing of 
the recording-editing-reproduction control unit 12 
during a preview and a real edit; 
20 Fig. 84 is a flowchart showing the update 

processing for the PGC information after a real edit; 

Fig. 8 5 shows an example of the interactive 
screen that is displayed on the TV monitor 72 to have 
the user make a selection of cell information as a 
25 element in a set of user-defined PGC information during 
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a virtual edit; 

Figs. 8 6A, 8 6B show the relationship between the 
user operation of the remote controller 71 and the 
display processing that accompanies the user operation; 
5 Figs. 87A to 87D show the relationship between 

the user operation of the remote controller 71 and the 
display processing that accompanies the user operation; 

Figs. 88A, 88B show the relationship between the 
user operation of the remote controller 71 and the 
10 display processing that accompanies the user operation; 

Figs. 89A, 89B show the relationship between the 
user operation of the remote controller 71 and the 
display processing that accompanies the user operations- 
Fig. 90 shows an example of the interactive 
15 screen that has the user select a set of user-defined 
PGC information or a preview (using the play key) or a 
real edit (using the real edit key) ; 

Fig. 91 shows an example of the original PGC 
information table and user-defined PGC information 
20 table, when the user-defined PGC information #2 composed 
of CELL#2B, CELL#4B, CELL#10B, and CELL#5B and the user- 
defined PGC information #3 composed of CELL#3C, CELL#6C, 
CELL#8C, CELL#9C have been defined; 

Figs. 92A-92B show the relationship between the 
25 user operation of the remote controller 71 and the 
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display processing that accompanies the user operation; 

Figs. 93A-93C show the relationship between the 
user operation of the remote controller 71 and the 
display processing that accompanies the user operation; 
5 Figs. 94A-94C show the relationship between the 

user operation of the remote controller 71 and the 
display processing that accompanies the user operation; 
and 

Fig. 95 shows the original PGC information table 
10 and the user-defined PGC information table after the 
processing of VOBs in a real edit. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 
15 The following embodiments describe a video data 

editing apparatus and the optical disc which the video 
data editing apparatus uses as recording medium. For 
ease of explanation, the explanation is divided into 
four embodiments that deal with the physical structure 
20 of the optical disc, the logical structure, the hardware 
structure of the video data editing apparatus, and the 
functional construction of the video data editing 
apparatus . 

The first embodiment explains the physical 
25 structure of the optical disc and the hardware structure 
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of the video data editing apparatus, as well as the 
seamless linking of video objects as the first basic 
example of video editing. 

The second embodiment explains seamless linking 
5 of partial sections of video objects as the second basic 
example. The third embodiment deals with the functional 
construction of the video data editing apparatus and the 
procedure for realizing video editing within a file 
system. 

10 The fourth embodiment describes the data 

structures and procedure of the video data editing 
apparatus when performing a two-stage editing process 
composed of virtual editing and real editing of two 
types of program chain called a user-defined PGC and an 

15 original PGC. 

(1-D Physical Structure of a Recordable Optical Disn 

Fig. 2A shows the external appearance of a 
DVD-RAM disc that is a recordable optical disc. As 

20 shown in this drawing, the DVD-RAM is loaded into a 

video data editing apparatus having been placed into a 
cartridge 75. This cartridge 75 protects the recording 
surface of the DVD-RAM, and has a shutter 7 6 which opens 
and closes to allow access to the DVD-RAM enclosed 

25 inside. 
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Fig. 2B shows the recording area of DVD-RAM disc 
which is a recordable optical disc. As shown in the 
figure, the DVD-RAM has a lead-in area at its innermost 
periphery and a lead-out area at its outermost 
5 periphery, with the data area in between. The lead-in 
area records the necessary reference signals for the 
stabilization of a servo during access by an optical 
pickup, and identification signals to prevent confusion 
with other media. The lead-out area records the same 

10 type of reference signals as the lead-in area. The data 
area, meanwhile, is divided into sectors which are the 
smallest unit by which the DVD-RAM can be accessed. 
Here, the size of each sector is set at 2KB. 

Fig. 2C shows the cross-section and surface of a 

15 DVD-RAM cut at the header of a sector. As shown in the 
figure, each sector is composed of a pit sequence that 
is formed in the surface of a reflective film, such as a 
metal film, and a concave-convex part. 

The pit sequence is composed of 0 . 4yim~l . 87pm pits 

20 that are carved into the surface of the DVD-RAM to show 
the sector address. 

The concave-convex part is composed of a concave 
part called a "groove" and a convex part called a 
"land". Each groove and land has a recording mark 

25 composed of a metal film capable of phase change 



WO 99/14935 



PCT/JP98/04174 



attached to its surface. Here, the expression "capable 
of phase change" means that the recording mark can be in 
a crystalline state or a non-crystalline state depending 
on whether the metal film has been exposed to a light 
5 beam. Using this phase change characteristic, data can 
be recorded into this concave-convex part. While it is 
only possible to record data onto the land part of an MO 
(Magnetic-Optical) disc, data can be recorded onto both 
the land and the groove parts of a DVD-RAM, meaning that 

10 the recording density of a DVD-RAM exceeds that of an MO 
disc. Error correction information is provided on a 
DVD-RAM for each group of 16 sectors. In this 
specification, each group of 16 sectors that is given an 
ECC (Error Correcting Code) is called an ECC block. 

15 On a DVD-RAM, the data area is divided to several 

zones to realize rotation control called 
Z-CLV (Zone-Constant Linear Velocity) during recording 
and reproduction. 

Fig. 3A shows the plurality of zones provided on 

20 a DVD-RAM. As shown in the figure, a DVD-RAM is divided 
to 24 zones numbered zone 0-zone 23. Each zone is a 
group of tracks that are accessed using the same angular 
velocity. In this embodiment, each zone includes 1888 
tracks. The rotational angular velocity of the DVD-RAM 

25 is set separately for each zone, with this velocity 
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being higher the closer a zone is located to the inner 
periphery of the disc. Division of the data area into 
zones ensures that the optical pickup can move at a 
constant velocity while performing access within a 
5 single zone. By doing so, the recording density of 

DVD-RAM is raised, and rotation control during recording 
and reproduction is made easier. 

Fig. 3B shows a horizontal arrangement of the 
lead-in area, the lead-out area, and the zones 0-23 that 

10 are shown in Fig. 3A. 

The lead-in area and lead-out area each include a 
defect management area (DMA: Defect Management Area) . 
This defect management area records position information 
showing the positions of sectors found to include 

15 defects and replacement position information showing 

whether the sectors used for replacing defective sectors 
are located in any of the replacement areas. 

Each zone has a user area, in addition to a 
replacement area and an unused area that are provided at 

20 the boundary with the next zone. A user area is an area 
that the file system can use as a recording area. The 
replacement area is used to replace defective sectors 
when such defective sectors are found. The unused area 
is an area that is not used for recording data. Only 

25 two tracks are used as the unused area, with such unused 
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area being provided to prevent mistaken identification 
of sector addresses. The reason for this is that while 
sector addresses are recorded at a same position in 
adjacent tracks within the same zone, for Z-CLV the 
5 recording positions of sector addresses are different 
for adjacent tracks at the boundaries between zones. 

In this way, sectors which are not used for data 
recording exist at the boundaries between zones. On a 
DVD-RAM, logical sector numbers (LSN: Logical Sector 

10 Number) are assigned to physical sectors of the user 
area in order starting from the inner periphery to 
consecutively show only the sectors used for recording 
data. As shown in Fig. 3C, the area that records user 
data and is composed of sectors that have been assigned 

15 an LSN is called the volume area. 

The volume area is used for recording AV files 
that are each composed of a plurality of VOBs and an 
RTRW (RealTime Rewritable) management file that is the 
management information for the AV files. These AV files 

20 and RTRW management file are in fact recorded in a file 
system according to ISO/IEC 13346, although this will 
not be explained in the present embodiment. The file 
system is dealt with in detail in the third embodiment 
below. 
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(1-2) Data Recorded in the Volume Area 

Fig. 4A shows the content of the data recorded in 
the volume area of a DVD-RAM. 

The video stream and audio stream shown on the 
5 fifth level of Fig. 4A are divided into units of around 
2KB, as shown on the fourth level. The units obtained 
through this division are interleaved into VOB#l and 
V0B#2 in the AV file shown on the third level as video 
packs and audio packs in compliance with MPEG standard. 

10 The AV file is split into a plurality of extents as 

shown on the second level, in compliance with ISO/IEC 
1334 6, and these extents each being stored in an empty 
area within one zone in the volume area, as shown on the 
first level of Fig. 4A. 

15 Information for VOB#l~VOB#3 is recorded in an 

RTRW management file as the VOB#l information, VOB#2 
information, and VOB#3 information shown on the fifth 
level. In the same way as an AV file, this RTRW file is 
divided into a plurality of extents that are recorded in 

20 empty areas in the volume area. 

The following explanation will deal with video 
streams, audio streams, and VOBs separately, having 
first explained the hierarchical structure of MPEG 
standard and DVD-RAM standard which define the data 

25 structures of these elements. 
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Fig. 4B shows the hierarchical structure of the 
data definitions used under MPEG standard. The data 
structure for MPEG standard is composed of an elementary 
stream layer and a system layer. 

The elementary stream layer shown in Fig. 4B 
includes a video layer that defines the data structure 
of video streams, an MPEG-Audio layer that defines the 
data structure of an MPEG-Audio stream, an AC 3 layer 
that defines the data structure of an audio stream under 
Dolby- AC3 methods, and a Linear-PCM layer that defines 
the data structure of an audio stream under Linear-PCM 
methods. The presentation start time 
(Presentation_Start_Time) and presentation end time 
(Presentation_End_Time) are defined within the 
elementary stream layer, though, as shown by the 
separate boxes used for the video layer, MPEG-Audio 
layer, AC-3 layer, and Linear-PCM layer, the data 
structures of the video stream and the audio stream are 
independent of each other. The presentation start time 
and presentation end time of a video frame and the 
presentation start time and presentation end time of an 
audio frame are similarly not synchronized. 

The system layer shown in Fig. 4B defines the 
packs, packets, DTS and PTS that are described later. 
In Fig. 4B, the system layer is shown in a separate box 
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to the video layer and audio layer, showing that the 
packs, packets, DTS and PTS are independent of the data 
structures of the video streams and audio streams. 

While the above layer structure is used for MPEG 
5 standard, DVD-RAM standard includes the system layer 

under MPEG standard shown in Fig. 4B and an elementary 
stream layer. In addition to the packs, packets, DTS, 
and PTS described above, DVD standard defines the data 
structures of the VOBs shown in Fig. 4A. 

10 

(1-2-1) Video Stream 

The video stream shown in Fig. 5A has a data 
structure that is defined by the video layer shown in 
Fig. 4B. Each video stream is composed of an 

15 arrangement of a plurality of sets of picture data that 
each correspond to one frame of video images. This 
picture data is a video signal according to NTSC 
(National Television Standards Committee) or PAL (Phase- 
Alternation Line) standard that has been compressed 

20 using MPEG techniques. Sets of picture data produced by 
compressing a video signal under NTSC standard are 
displayed by video frames that have a frame interval of 
around 33msec (1/29.97 seconds to be precise), while 
sets of picture data produced by compressing a video 

25 signal under PAL standard are displayed by video frames 
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that have a frame interval of 4 0msec. The top level of 
Fig. 5A shows examples of video frames. In Fig. 5A, the 
sections indicated between the "<" and ">" symbols are 
each a video frame, with the "<" symbol showing the 
5 presentation start time ( Presentation_Start__Time ) for 
each video frame and the ">" symbol showing the 
presentation end time ( Presentat ion_End__Time ) for each 
video frame. This notation for video frames is also 
used in the following drawings. The sections which are 

10 enclosed by these symbols each include a plurality of 
video fields. 

As shown in Fig. 5A, the picture data that should 
be displayed for a video frame is inputted into a 
decoder before the Presentation_Start_Time of the video 

15 frame and must be taken from the buffer by the decoder 
at the Presentation_Start_Time . 

When compression is performed in accordance with 
MPEG standards , the spatial frequency characteristics 
within the image of one frame and the time-related 

20 correlation with images that are displayed before or 

after the one frame are used. By doing so, each set of 
picture data is converted into one of a Bidirectionally 
Predicative (B) Picture, a Predicative (P) Picture, or 
an Intra (I) Picture. A B picture is used where 

25 compression is performed using the time-related 
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correlation with images that are reproduced both before 
and after the present image. A P picture is used where 
compression is performed using the time-related 
correlation with images that are reproduced before the 
5 present image. An I picture is used where compression 

is performed using the spatial frequency characteristics 
within one frame without using time-related correlation 
with other images. Fig. 5A shows B pictures, P 
pictures, and I pictures as all having the same size, 

10 although it should be noted that there is in fact great 
variation in their sizes. 

When decoding a B picture or a P picture that use 
the time-related correlation between frames, it is 
necessary to refer to the images that are to be 

15 reproduced before or after the picture being decoded. 

For example, when decoding a B picture, the decoder has 
to wait until the decoding of the following image has 
been completed. 

As a result, an MPEG video stream defines the 

20 coding order of each picture in addition to defining the 
display order of the pictures. In Fig. 5A, the second 
and third levels respectively show the sets of picture 
data arranged in display order and in coding order. 

In Fig. 5A, the reference target of one of the B 

25 pictures is shown by the broken line to be the following 
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I picture. In the display order, this I picture follows 
the B picture, though since the B picture is compressed 
using time-related correlation with the I picture, the 
decoding of the B picture has to wait for the decoding 
5 of the I picture to be completed. As a result, the 

coding order defines that the I picture comes before the 
B picture. This rearranging of the display order of 
pictures when generating the coding order is called 
"reordering" . 

10 As shown on the third level of Fig. 5A, each set 

of picture data is divided into 2KB units after being 
arranged into the coding order. The resulting 2KB units 
are stored as a video pack sequence, as shown on the 
bottom level of Fig. 5A. 

15 When a sequence of B pictures and P pictures is 

used, problems can be caused, such as by special 
reproduction features that perform decoding starting 
midway through the video stream. To prevent such 
problems, an I picture is inserted into the video data 

20 at 0.5s intervals. Each sequence of picture data 

starting from an I picture and continuing as far as the 
next I picture is called a GOP (Group Of Pictures) , with 
GOPs being defined in the system layer of MPEG standard 
as the unit for MPEG compression. On the third level of 

25 Fig. 5A, the dotted vertical line shows the boundary 
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between the present GOP and the following GOP. In each 
GOP, the picture type of the picture data that is 
arranged last in the display order is a P picture, while 
the picture type of the picture data that is arranged 
5 first in the coding order must be an I picture. 

M-2-2) Audio Stream 

The audio stream is data that has been compressed 
according to one of Dolby-AC3 method, MPEG method, and 

10 Linear-PCM. Like a video stream, an audio stream is 
generated using audio frames that have a fixed frame 
interval. Fig. 5B shows the correspondence between the 
audio frames and audio data. In detail, the 
reproduction period of an audio frame is 32msec for 

15 Dolby-AC3, 24msec for MPEG, and around 1.67msec 
(l/600sec to be precise) for Linear-PCM. 

The top level of Fig. 5B shows example audio 
frames. In Fig. 5B, each section indicated between the 
"<" and ">" symbols is an audio frame, with the "<" 

20 symbol showing the presentation start time and the ">" 

symbol showing the presentation end time. This notation 
for video frames is also used in the following drawings. 
The audio data that should be displayed for an audio 
frame is inputted into a decoder before the presentation 

25 start time of the audio frame and should be taken out of 
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the buffer by the decoder at the presentation start 
time . 

The bottom level of Fig. 5B shows an example of 
how the audio data to be reproduced in each frame is 
5 stored in audio packs. In this figure, the audio data 
to be reproduced for audio frames f81, f82 is stored in 
audio pack A71, the audio data to be reproduced for 
audio frame f84 is stored in audio pack A72, and the 
audio data to be reproduced for audio frames f86, f87 is 

10 stored in audio pack A73. The audio data to be 

reproduced for audio frame f83 is divided between the 
audio pack A71 that comes first and the audio pack A72 
which comes later. In the same way, the audio data to 
be reproduced for audio frame f86 is divided between the 

15 audio pack A72 that comes first and the audio pack A73 
which comes later. The reason the audio data to be 
reproduced for one audio frame is stored divided between 
two audio packs is that the boundaries between audio 
frames and video frames do not match the boundaries 

20 between packs. The reason that such boundaries do not 
match is that the data structure of packs under MPEG 
standard is independent of the data structure of video 
streams and audio streams. 

25 1-2-3 Dat a Structure of VOBs 
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The VOBs (Video Objects) #1, #2, #3 ... shown in 
Fig. 4A are program streams under ISO/IEC 13818-1 that 
are obtained by multiplexing a video stream and audio 
stream, although these VOBs do not have a 
5 program_end__code at the end. 

Fig. 6A shows the detailed hierarchy for the 
logical construction of VOBs. This means that the 
logical format located on the highest level of Fig. 6A 
is shown in more detail in the lower levels. 

10 The video stream that is located on the highest 

level in Fig. 6A is shown divided into a plurality of 
GOPs on the second level, with these GOPs having been 
shown in Fig. 5A. As in Fig. 5A, the picture data in 
GOP units is divided into a large number of 2KB units. 

15 On the other hand, the audio stream shown on the left of 
the highest level in Fig. 6A is divided into a large 
number of approximately 2KB units on the third level in 
the same way as in Fig. 5B. The picture data for a GOP 
unit that is divided into 2KB units is interleaved with 

20 the audio stream that is similarly divided into 
approximately 2KB units. This produces the pack 
sequence on the fourth level of Fig. 6A. This pack 
sequence forms a plurality of VOBUs (Video Object Units) 
that are shown on the fifth level, with the VOB (Video 

25 Object) shown on the sixth level being composed of a 
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plurality of these VOBUs arranged in a time series. In 
Fig. 6 A , the guidelines drawn using broken lines show 
the relations between the data in the data structures on 
adjacent levels. By referring to the guidelines in Fig. 
5 6A, it can be seen that the VOBUs on the fifth level 

correspond to the pack sequence on the fourth level and 
the picture data in GOP units shown on the second level. 

As can be seen by tracing the guidelines, each 
VOBU is a unit that includes at least one GOP composed 

10 of picture data with a reproduction period of around 0.4 
to 1.0 second and audio data that has been interleaved 
with this picture data. At the same time, each VOBU is 
composed of an arrangement of video packs and audio 
packs under MPEG standard. The unit called a GOP under 

15 MPEG standard is defined by the system layer, although 
when only video data is specified by a GOP, as shown on 
the second level of Fig. 6A, the audio data and other 
data (such as sub-picture data and control data) that is 
multiplexed with the video data is not indicated by the 

20 GOP. Under DVD-RAM standard, the expression "VOBU" is 
used for a unit that corresponds to a GOP, with this 
unit being a general name for at least one GOP composed 
of picture data with a reproduction period of around 0.4 
to 1.0 second and the audio data that has been 

25 interleaved with this picture data. 
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Here, it is possible for parts of a VOB to be 
deleted, with the minimum unit being one VOBU. As one 
example, the video stream recorded on a DVD-RAM as a VOB 
may contain images for a commercial that are not wanted 
5 by the user. The VOBUs in this VOB include at least one 
GOP that composes the commercial and audio data that is 
interleaved with this picture data, so that if only the 
VOBUs in the VOB that correspond to the commercial can 
be deleted, the user will then be able to watch the 

10 video stream without having to watch the commercial. 
Here, even if one VOBU is deleted, for example, the 
VOBUs on either side of the deleted VOBU will include a 
part of the video stream in GOP units that each have an 
I picture located at their front. This means that a 

15 normal decode and reproduction process are possible, 
even after the deletion of the VOBU. 

Fig. 6B shows an example where part of a VOB is 
deleted. This VOB originally includes VOBU#l, VOBU#2, 
VOBU#3, VOBU#4 VOBU#7 . When the deletion of 

20 VOBU#2, VOBU#4, and VOBU#6 is indicated, the areas that 
were originally occupied by these VOBUs are freed and so 
are shown as empty areas on the second level of Fig. 6B. 
When the VOB is reproduced thereafter, the reproduction 
order is VOBU#l, VOBU#3, VOBU#5, and VOBU#7 . 

25 The video packs and audio packs included in a 
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VOBU each have data length of 2KB. This 2KB size 
matches the sector size of a DVD-RAM, so that each video 
pack and audio pack is recorded in a separate sector. 

The arrangement of video packs and audio packs is 
5 corresponds to the arrangement of an equal number of 
consecutive logical sectors, and the data held within 
these packs is read from the DVD-RAM. This is to say, 
the arrangement of video packs and audio packs refers to 
the order in which these packs are read from the DVD- 
10 RAM. Since each video pack is approximately 2KB in 

size, if the data size of the video stream for one VOBU 
is several hundred KB, for example, the video stream 
will be stored having been divided into several hundred 
video packs . 

15 

(1-2-3-1) Data Structure of Video Packs and Audio Packs 

Figs. 6C to 6E show the logical format of the 
video packs and audio packs stored in a VOBU. Normally, 
a plurality of packets are inserted into one pack in an 

20 MPEG system stream, although under DVD-RAM standard, the 
number of packets that may be inserted into one pack is 
restricted to one. Fig. 6C shows the logical format of 
a video pack arranged at the start of a VOBU. As shown 
in Fig. 6C, the first video pack in a VOBU is composed 

25 of a pack header, a system header, a packet header, and 
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video data that is part of the video stream. 

Fig. 6D shows the logical format of the video 
packs that do not come first in the VOBU. As shown in 
Fig. 6D, these video packs are each composed of a pack 
5 header, a packet header, and video data, with no system 
header . 

Fig. 6E shows the logical format of the audio 
packs. As shown in Fig. 6E, each audio pack is composed 
of a pack header, a packet header, a sub_stream_id 
10 showing whether the compression method used for the 

audio stream included in the present pack is Linear-PCM 
or Dolby-AC3, and audio data that is part of the audio 
stream and has been compressed according to the 
indicated method. 

15 

(1-2-3-2-1) Buffer Control within a VOB 

The video stream and audio stream are stored in 
video packs and audio packs as described above. 
However, in order to seamlessly reproduce VOBs, it is 

20 not sufficient to store the video stream and audio 

stream in video packs and audio packs, with it being 
necessary to suitably arrange of video packs and audio 
packs to ensure that buffer control will be 
uninterrupted. The buffers referred to here are input 

25 buffers for temporarily storing the video stream and the 
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audio stream before input into a decoder. Hereinafter, 
the separate buffers are referred to as the video buffer 
and the audio buffer, with specific examples being shown 
in as the video buffer 4b and the audio buffer 4d in 
5 Fig. 19. Uninterrupted buffer control refers to input 
control for the buffer that ensures that overflow or 
underflow do not occur for either input buffer. This is 
described in more detail later, but is fundamentally 
achieved by assigning time stamps (showing the correct 

10 times for the input, output, and display of data) that 
are standardized for an MPEG stream to the pack header 
and packet header shown in Fig. 6D and Fig. 6E . If no 
underflows or overflows occur for the video buffer and 
audio buffer, no interruptions will occur in the 

15 reproduction of the video streams and audio streams. As 
will be clear from this specification, it is very 
important that buffer control is uninterrupted. 

There is a time limitation whereby each set of 
audio data needs transferred to the audio buffer and 

20 decoded by the presentation start time of the audio 
frame to be reproduced by such data, but since audio 
streams are encoded using fixed-length encoding with a 
relatively small amount of data, the data that is 
required for the reproduction of each audio frame can be 

25 stored in audio packs. These audio packs are 
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transferred to the audio buffer during reproduction, 
meaning that the time limitation described above can be 
easily managed. 

Fig. 7A is a figure showing the ideal buffer 
operation for the audio buffer. This figure shows how 
the buffer occupancy changes for a sequence of audio 
frames. In this specification, the term "buffer 
occupancy" refers to the extent to which the capacity of 
a buffer is being used to store data. The vertical axis 
of Fig. 7A shows the occupancy of the audio buffer, 
while the horizontal axis represents time. This time 
axis is split into 32msec sections, which matches the 
reproduction period of each audio frame in the Dolby-AC3 
method. By referring to this graph, it can be seen that 
the occupancy of the buffer changes over time to 
exhibits a sawtooth pattern. 

The height of each triangular tooth that composes 
the sawtooth pattern represents the amount of data in 
the part of the audio stream to be reproduced in each 
audio frame . 

The gradient of each triangular tooth represents 
the transfer rate of the audio stream. This transfer 
rate is the same for all audio frames. 

During the period corresponding to one triangular 
tooth, audio data is accumulated with a constant 
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transfer rate during the display period (32msec) of the 
audio frame preceding the audio frame that is reproduced 
by this audio data. At the presentation end time of the 
preceding audio frame (this time representing the decode 
5 time for the present frame) , the audio data for the 
present frame is instantly outputted from the audio 
buffer. The reason a sawtooth pattern is achieved is 
that the processing from the storage in the buffer to 
output from the buffer is continually repeated. 

10 As one example, assume that transfer of an audio 

stream to the audio buffer begins at time Tl. This 
audio data should be reproduced at time T2, so that 
amount of data stored in the audio buffer will gradually 
increase between time Tl to time T2 due to the transfer 

15 of this audio data. However, because this transferred 

audio data is output at the presentation end time of the 
preceding audio frame, the audio buffer will be cleared 
of audio data at that point, so that the occupancy of 
the audio buffer returns to 0 . In Fig. 7A, the same 

20 pattern is repeated between time T2 and time T3, between 
time T3 and time T4, and so on. 

The buffer operation shown in Fig. 7A is the 
ideal buffer operation state for the premise where the 
audio data to be reproduced in each audio frame is 

25 stored in one audio pack. In reality, however, it is 
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normal for audio data that will be reproduced in several 
different audio frames to be stored in one audio pack, 
as shown in Fig. 5B. Fig. 7B shows a more realistic 
operation for the audio buffer. In this figure, audio 
5 pack A31 stores audio data A21, A22, and A23 which 

should respectively be decoded by the presentation end 
times of audio frame £21, f22, and f23. As shown in 
Fig. 7B, only the decoding of audio data A21 will be 
completed at the presentation end time of audio frame 

10 f21, with the decoding of the other sets of audio data 
f22 and f23 being respectively completed by the 
presentation end times of the following audio frames f22 
and f23. Of the audio frames included in this audio 
pack, audio data A21 should be decoded first, with the 

15 decoding of this audio data needing to be completed by 
the presentation end time of audio frame f21. 
Accordingly, this audio pack should be read from the 
DVD-RAM during the reproduction period of the audio 
frame f21. 

20 Video streams are encoded with variable code 

length due to the large differences in code size between 
the different types of pictures (I pictures, P pictures, 
and B pictures) used in compression methods that use 
time-related correlation. Video streams also include a 

25 significant amount of data, so that it is difficult to 
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complete the transfer of the picture data for a video 
frame, especially the picture data for an I picture, by 
the presentation end time of the preceding video frame. 

Fig. 7C is a graph showing video frames and the 
5 occupancy of the video buffer. In Fig. 7C, the vertical 
axis represents the occupancy in the video buffer, while 
the horizontal axis represents time. This horizontal 
axis is split into 33msec sections which each match the 
reproduction period of a video frame under NTSC 

10 standard. By referring to this graph, it can be seen 
that the changes in the occupancy of the video buffer 
changes over time to exhibit a sawtooth pattern. 

The height of each triangular tooth that composes 
the sawtooth pattern represents the amount of data in 

15 the part of the video stream to be reproduced in each 

video frame. As mentioned before, the amount of data in 
each video frame is not equal, since the amount of code 
for each video frame is dynamically assigned according 
to the complexity of the frame. 

20 The gradient of each triangular tooth shows the 

transfer rate of the video stream. The approximate 
transfer rate of the video stream is calculated by 
subtracting the output rate of the audio stream from the 
output rate of the track buffer. This transfer rate is 

25 the same during each frame period. 
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During the period corresponding to one triangular 
tooth in Fig. 7C, picture data is accumulated with a 
constant transfer rate in during the display period 
(33msec) of the video frame preceding the video frame 
5 that is reproduced by this picture data. At the 

presentation end time of the preceding video frame (this 
time representing the decode time for the present 
picture data) , the picture data for the present frame is 
instantly outputted from the video buffer. The reason a 

10 sawtooth pattern is achieved is that the processing from 
the storage in the video buffer to output from the video 
buffer is continually repeated. 

When the image to be displayed in a given video 
frame is complex, a larger amount of code needs to be 

15 assigned to this frame. When a larger amount of code is 
assigned, this means that the pre-storage of data in the 
video buffer needs to be commenced well in advance. 

Normally, the period from the transfer start 
time, at which the transfer of picture data into the 

20 video buffer is commenced, to the decode time for the 
picture data is called the VBV (Video Buffer Verify) 
delay. In general, the more complex the image, the 
larger the amount of assigned code and the longer the 
VBV delay. 

25 As can be seen from Fig. 7C, the transfer of the 
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picture data that is decoded at the presentation end 
time T16 of the preceding video frame is commenced at 
time Til. The transfer of picture data that is decoded 
at the presentation end time T18 of the preceding video 
5 frame, meanwhile, is commenced at time T12. The 

transfer of the picture data for other video frames can 
be seen to be commenced at times T14, T15, T17, T19, 
T20, and T21. 

Fig. 7D shows the transfer of sets of picture 

10 data in more detail. When considering the situation in 
Fig. 7C, the transfer of the picture data to be decoded 
at time T24 in Fig. 7D needs to be completed in the 
"Tf_Period" between the start time T23 of the "VBV 
delay" and the start of the transfer of the picture data 

15 for the next video frame to be reproduced. The increase 
in the occupancy of the buffer that occurs from this 
Tf_Period onwards is caused by the transfer of the 
picture data for the image to be displayed in the next 
video frame. 

20 The picture data accumulated in the video buffer 

waits for the time T24 at which the picture data is to 
be decoded. At the decode time T24, the image A is 
decoded, which clears part of the picture data stored in 
the video buffer, thereby reducing the total occupancy 

25 of the video buffer. 
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When considering the above situation, it can be 
seen that while it is sufficient for the transfer of 
audio data to be reproduced in a certain audio frame to 
be commenced around one frame in advance, the transfer 
5 of picture data for a certain video frame needs to be 
commenced well before the decode time of such picture 
data. In other words, the audio data which should be 
reproduced in a certain audio frame should be inputted 
into the audio buffer at around the same time as picture 

10 data for a video frame that is well in advance of the 

audio frame. This means that when the audio stream and 
video stream are multiplexed into an MPEG stream, audio 
data needs to multiplexed well before the corresponding 
picture data. As a result, the video data and audio 

15 data in a VOBU are in fact composed of video data that 
will be reproduced later and audio data. 

The arrangement of the plurality of video packs 
and audio packs has been described as reflecting the 
transfer order of the data included in the packs. 

20 Accordingly, to have the audio data to be reproduced in 
an audio frame read at approximately the same time as 
the picture data to be reproduced in a video frame that 
is well ahead of the audio frame, the audio packs and 
video packs that store the audio data and picture data 

25 in question need to be arranged into a same part of the 
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VOB. 

Fig. 8A shows how the audio packs, which store 
audio data to be reproduced in each audio frame, and the 
video packs, which show the picture data to be 
5 reproduced in each video frame, should be stored. 

In Fig. 8A, the rectangles marked with "V" and 
"A" show each video pack and audio pack. Fig. 8B shows 
the meaning of the width and height of each of these 
rectangles. As shown in Fig. 8B, the height of each 
10 rectangle shows the bitrate used to transfer the pack. 
As a result, packs that have a tall height are 
transferred with a high bitrate, which means that the 
pack can be inputted into a buffer relatively quickly. 
Packs that are not tall, however, are transferred with a 
15 low bitrate, and so take a relatively long time to be 
transferred into the buffer. 

The picture data Vll that is decoded at time Til 
in Fig. 8B is transferred during the period kll. Since 
the transfer and decoding of the audio data All are 
20 performed during this period kll, the video packs that 
store the video data Vll and the audio pack that stores 
the audio data All are arranged into a similar position, 
as shown in the lower part of Fig. 8A. 

The picture data V12 that is decoded at time T12 
25 in Fig. 8A is transferred during the period kl2 . Since 



WO 99/14935 



PCT/JP98/04174 



the transfer and decoding of the audio data A12 are 
performed during this period kl2, the video packs that 
store the video data V12 and the audio pack that stores 
the audio data A12 are arranged into a similar position, 
5 as shown in the lower part of Fig. 8A. 

In the same way, the audio data A13, A14, and A15 
are arranged into similar positions as the picture data 
V13 and V14 whose transfer is commenced at the output 
time of these sets of audio data. 

10 Note that when picture data with a large amount 

of assigned code, such as picture data V16, is 
accumulated in the buffer, a plurality of audio data 
A15, A16, and A17 are multiplexed during kl6 which is 
the transfer period of the picture data VI 6. 

15 Fig. 9 shows how audio packs that store a 

plurality of sets of audio data to be reproduced in a 
plurality of audio frames and video packs that store 
picture data to be reproduced in each video frame may be 
stored. In Fig. 9, audio pack A31 stores the audio data 

20 A21, A22, and A23 that is to be reproduced for audio 
frames f21, f22, and f23. Of the audio data that is 
stored in the audio pack A31, the first audio data to be 
decoded is the audio data A21. Since the audio data A21 
needs to be decoded at the presentation end time of the 

25 audio frame f20, this audio data A21 needs to be read 
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from the DVD-RAM together with the picture data Vll 
whose transfer is performed during the same period 
(period kll) as the audio frame f20. As a result, the 
audio pack A31 is arranged near the video packs that 
5 store the picture data Vll. 

When considering that an audio pack can store 
audio data which should be decoded for several audio 
frames, and that audio packs are arranged in similar 
positions to video packs that are composed of picture 

10 data which should be decoded in the future, it may seem 
that the audio data and picture data to be decoded at 
the same time should be stored in audio packs and video 
packs that are at distant positions within a VOB. 
However, there will be no cases where video packs which 

15 store picture data that will be decoded one second or 
more later are stored alongside audio data that should 
be decoded at the same time. This is because MPEG 
standard defines the upper limit for the time data can 
be accumulated in the buffer, with all data having to be 

20 outputted from the buffer within one second of being 
inputted into the buffer. This restriction is called 
the "one-second rule" for MPEG standard. Because of the 
one-second rule, even if audio data and picture data 
that are to be decoded at the same time are arranged 

25 into distant positions, the audio pack that stores the 
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audio data to be decoded at a given time will definitely 
be stored within a range of 3 VOBUs from the VOBU that 
stores the picture data to be decoded at the same given 
time . 

5 

(1-2-3-2-2) Buffer Control Between VQBs 

The following explanation deals with the buffer 
control that is performed when reproducing two or more 
VOBs successively. Fig. 10A shows the buffer state 

10 for the first part of a video stream. In Fig. 10A, the 
input of the pack that includes the picture data is 
commenced at the point indicated as First_SCR during the 
video frame f71, with the amount of data shown as BT2 
being transferred by the presentation end time of the 

15 video frame f72. Similarly, the amount of data BT3 has 
been accumulated in the buffer by the presentation end 
time of the video frame f73. This data is read from the 
video buffer by the video decoder at the presentation 
end time of the video frame f74, with this time being 

20 indicated hereafter by the notation FirstJDTS. In this 
way, the state of the buffer changes as shown in Fig. 
10A, with no data for a preceding video stream at the 
start and the accumulated amount of data gradually 
increasing to trace a triangular shape. Note here that 

25 Fig. 10A is drawn with the premise that the video pack 
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is inputted at the time First_SCR, although when the 
pack positioned at the front of a VOB is a different 
pack, the start of the increased in the amount of 
buffered data will not match the time First_SCR. Also, 
5 the reason Last_SCR is positioned midway through a video 
frame is that the data structure of the pack is 
unrelated to the data structure of the video data. 

Fig. 10B shows the buffer state during the latter 
part of a video stream. In this drawing, the input of 

10 data into the video buffer is completed at the time 

Last_SCR that is located midway through video frame f61. 
After this, only the data amount A3 of the accumulated 
video data is taken from video buffer at the 
presentation end time of video frame f61. Following 

15 this, it can be seen that only the data amount A4 is 

taken from video buffer at the presentation end time of 
video frame f62, and only the data amount A 5 is taken 
at the presentation end time of video frame f63, this 
latter time also being called the Last_DTS. 

20 For the latter part of a VOB, the input of video 

packs and audio packs is completed by the time shown as 
Last_SCR in Fig. 10B, so that the amount of data stored 
in the video buffer will thereafter decrease in steps at 
the decoding of video frames f61, f62, f63 and f 64 . As a 

25 result, the occupancy of the buffer decreases in steps 
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at the end of a video stream, as shown in Fig. 10B. 

Fig. IOC shows the buffer state across VOBs . In 
more detail, this drawing shows the case where the 
latter part of a video stream that causes the buffer 
5 state shown in Fig. 10B is seamlessly linked to the 
former part of another video stream that causes the 
buffer state shown in Fig. 10A. 

When these two video streams are seamlessly 
linked, the First_DTS of the former part of the second 

10 video stream to be reproduced needs to follow after the 
video frame with the Last_DTS of the latter part of the 
first video stream. In other words, the decoding of the 
first video frame in the second video stream needs to be 
performed after the decoding of the video frame with the 

15 final decode time in the first video stream. If the 

interval between the Last_DTS of the latter part of the 
first video stream and the First_DTS of the former part 
of the second video stream is equivalent to one video 
frame, the picture data of the latter part of the first 

20 video stream will coexist in the video buffer with the 
picture data of the former part of the second video 
stream, as shown in Fig. 10C. 

In Fig. 10C, it is assumed that the video frames 
f71, f72, and f73 shown in Fig. 10A match the video 

25 frames f61, f62, and f63 shown in Fig. 10B. In such 
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conditions, at the presentation end time of video frame 
f71, the picture data BEl of the latter part of the 
first video stream and the picture data BT1 of the 
former part of the second video stream are present in 
5 the video buffer. At the presentation end time of the 

video frame f72, the picture data BE2 of the latter part 
of the first video stream and the picture data BT2 of 
the former part of the second video stream are present 
in the video buffer. At the presentation end time of 

10 the video frame £13, the picture data BE 3 of the latter 
part of the first video stream and the picture data BT3 
of the former part of the second video stream are 
present in the video buffer. As the decoding of video 
frames progresses, the picture data of the latter part 

15 of the first video stream decreases in steps, while the 
picture data of the former part of the second video 
stream gradually increases. These decreases and 
increases occur concurrently, so that the buffer state 
shown in Fig. 10C exhibits a sawtooth pattern which 

20 closely resembles the buffer state shown for VOBs in 
Fig. 7C. 

It should be noted here that each of total 
BTl+BEl of the data amount BT1 and the data amount BEl, 
total BT2+BE2 of the data amount BT2 and the data amount 
25 BE2, and total BT3+BE3 of the data amount BT3 and the 
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data amount BE3 is below the capacity of the video 
buffer. Here, if any of these totals BT1+BE1, BT2+BE2 
or BT3+BE3 exceeds the capacity of the video buffer, an 
overflow will occur in the video buffer. If the highest 
5 of these totals is expressed as Bvl+Bv2, this value 

Bvl+Bv2 must be within the capacity of the video buffer. 

(1-2-3-3) Pack Header, S ystem Header, Packet H^riPr 

The information for the buffer control described 

10 above is written as time stamps in the pack header, the 
system header, and the packet header shown in Figs. 
6F-6H. Figs. 6F-6H show the logical formats of the pack 
header, the system header, and the packet header. As 
shown in Fig. 6F, the pack header includes a 

15 Pack_Start_Code, an SCR (System Clock Reference) showing 
the time at which the data stored in the present pack 
should be inputted into the video buffer and audio 
buffer, and a Program_max_rate . In a VOB, the first SCR 
is set as the initial value of the STC (System Time 

20 Clock) that is provided as a standard feature in a 
decoder under MPEG standard. 

The system header shown in Fig. 6G is only 
appended to the video pack that is located at the start 
of a VOBU. This system header includes maximum rate 

25 information (shown as the "Rate .bound . info" in Fig. 6G) 
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showing the transfer rate to be requested of the 
reproduction apparatus when inputting the data, and 
buffer size information (shown as "Buffer -bound, info" in 
Fig. 6G) showing the highest buffer size to be requested 
5 of the reproduction apparatus when inputting the data in 
the VOBU. 

The packet header shown in Fig. 6H includes a DTS 
(Decoding Time Stamp) showing the decoding time and, for 
a video stream, a PTS (Presentation Time Stamp) shown 

10 the time at which data should be outputted after 

reordering the decoded video stream. The PTS and DTS 
are set based on the presentation start time of a video 
frame or audio frame. In the data construction, a PTS 
and a DTS can be set for all packs, although it is rare 

15 for such information for picture data that should be 
displayed for all the video frames. It is common for 
such information to be assigned once in a GOP, which is 
to say once every 0.5 seconds of reproduction time. 
Every video pack and audio pack is assigned an SCR, 

20 however. 

For a video stream, it is common for a PTS to be 
assigned to each video frame in a GOP, though for an 
audio stream, it is common for a PTS to be assigned 
every one or two audio frames. For an audio stream, 

25 there will be no difference between the display order 
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and the coding order, so that no DTS is required. When 
one audio pack stores all of the audio data that is to 
be reproduced for two or more audio frames, a PTS is 
written at the start of the audio pack. 
5 As one example, the audio pack A71 shown in Fig. 

5B may be given the presentation start time of the audio 
frame f81 as the PTS. On the other hand, the audio pack 
A72 that stores the divided audio frame f83 must be 
given the presentation start time of the audio frame 
10 f84, not the presentation start time of the audio frame 
f83, as the PTS. This is also the case for the audio 
pack A73, which must be given the presentation start 
time of the audio frame f86, not the presentation start 
time of the audio frame f85, as the PTS. 

15 

(1-2-3-4) Continuity of Time Stamps 

The following is an explanation of the values 
that are set as the PTS, DTS, and SCR for video packs 
and audio packs, as shown in Figs. 6F to 6H. 

20 Fig. 11A is a graph showing the values of the SCR 

of packs included in a VOB in the order that packs are 
arranged in the VOB. The horizontal axis shows the 
order of the video packs, with the vertical axis shows 
the value of the SCR which is assigned to each pack. 

25 The first value of the SCR in Fig. 11A is not 
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zero, and is instead a predetermined value shown as 
Initl. The reason the first value of the SCR is not 
zero is that the VOBs that are processed by a video 
editing apparatus are subjected to many editing 
5 operations, so that there are many cases where the first 
part of a VOB will have already been deleted. It should 
be obvious that the initial value of the SCR of a VOB 
that has just been encoded will be zero, although the 
present embodiment assumes that the initial value of the 

10 SCR for a VOB is not zero, as shown in Fig. 11A. 

In Fig. 11A, the closer a video pack is to the 
start of the VOB, the lower the value of the SCR of that 
video pack, and the further a video pack is from the 
start of the VOB, the higher the value of the SCR of 

15 that video pack. This characteristic is referred to as 
the "continuity of time stamps", with the same 
continuity being exhibited by the DTS. Though the 
coding order of video packs is such that a latter video 
pack may in fact be displayed before a former video 

20 pack, meaning that the PTS of the latter pack has a 
lower value than the former pack, the PTS will still 
exhibit a rough continuity in the same way as the SCR 
and the DTS . 

The SCR of audio packs exhibits continuity in the 
25 same way as for video packs. 
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The continuity of the SCR, DTS, and PTS is a 
prerequisite for the proper decoding of VOBs . The 
following is an explanation of the values used for SCR 
to maintain this continuity. 
5 In Fig. 11B, the straight line showing the values 

of SCR in the section B is an extension of the straight 
line showing the values of SCR in the section A. This 
means that there is continuity between the values of SCR 
between section A and section B. 

10 In Fig. 11C, the first value of SCR in the period 

D is higher than the largest value on the straight line 
showing the values of SCR in the section C. However, in 
this case also, the closer a pack is to the start of the 
VOB, the lower the value of SCR, and the further a video 

15 pack is from the start of the VOB, the higher the value 
of SCR. This means that there is continuity of the time 
stamps between section C and section D. 

Here, when the difference in time stamps is 
large, these stamps are naturally non-continuous. Under 

20 MPEG standard, the difference between pairs of time 

stamps, such as SCRs, must not exceed 0.7 seconds, so 
that areas in the data where this value is exceeded are 
treated as being non-continuous. 

In Fig. 11D, the last value of SCR in section E 

25 is higher than the first value on the straight line 
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showing the values of SCR in section F. In this case, 
the continuity wherein the closer a pack is to the start 
of the VOB, the lower the value of SCR, and the further 
a video pack is from the start of the VOB, the higher 
5 the value of SCR is no longer valid, so that there is no 
continuity in the time stamps between section E and 
section F. 

When there is no continuity in the time stamps, 
as the example of section E and section F, the former 
10 and latter sections are managed as separate VOBs. 

It should be noted that the details of buffer 
control between VOBs and the multiplexing method are 
described in detail in the PCT publications "WO 
97/13367" and "WO 97/13363". 

15 

(1-2-4) AV Files 

An AV file is a file that records at least one 
VOB that is to be reproduced consecutively. When a 
plurality of VOBs are held within one AV file, these 

20 VOBs are reproduced in the order they are stored in the 
AV file. For the example in Fig. 4, the three VOBs, 
VOB#l, VOB#2, and VOB#3, are stored in one AV file, with 
these VOBs being reproduced in the order VOB#l -> VOB#2 
-+ VOB#3. When VOBs are stored in this way, the buffer 

25 state for the video stream positioned at the end of the 
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first VOB to be reproduced and the video stream 
positioned at the start of the next VOB to be reproduced 
will be as shown in Fig. IOC. Here, if the highest 
amount data Bvl+Bv2 to be stored in the buffer exceeds 
5 the capacity of the buffer, or if the first time stamp 
in the VOB to be reproduced second is not continuous 
with the last time stamp in the VOB to be reproduced 
first, there is the danger that seamless reproduction 
will not be possible for the first and second VOBs . 

10 

(1-3) Logical Construction of the RTRW Management File 

The following is an explanation of the RTRW 
management file. The RTRW management file is 
information showing attributes for each VOB stored in an 

15 AV file. 

Fig. 12A shows the detailed hierarchical 
structure in which data is stored in the RTRW management 
file. The logical format shown on the right of Fig. 12A 
is a detailed expansion of the data shown on the left, 

20 with the broken lines serving as guidelines to clarify 
which parts of the data structure are being expanded. 

By referring to the data structure in Fig. 12B, 
it can be seen that the RTRW management file records VOB 
information for VOB#l, VOB#2, VOB#3, ... VOB#6, and that 

25 this VOB information is composed of VOB general 
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information, stream attribute information, a time map 
table, and seamless linking information. 



5 (1-3-1) VOB General Information 

The "VOB general information" refers to the VOB- 
ID that is uniquely assigned to each VOB in an AV file 
and to the VOB reproduction period information of each 
VOB. 

10 

(1-3-2) Stream Attribute Information 

The stream attribute information is composed of 
video attribute information and audio attribute 
information . 

15 The video attribute information includes video 

format information that indicates one of MPEG2 and 
MPEG1, and a display method that indicates one of NTSC 
and PAL/SECAM. When the video attribute information 
indicates NTSC, an indication such as "720x480" or 

20 "352x240" may be given as the display resolution, and an 
indication such as "4:3" or "16:9" may be given as the 
aspect ratio. The presence/absence of copy prevention 
control for an analog video signal may also be 
indicated, as may the presence/absence of a copy guard 

25 for a video cassette recorder which damages the AGC 
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circuit of a VTR by changing the signal amplitude during 
the blank period of a video signal. 

The audio attribute information shows the 
encoding method which may be one of MPEG2, Dolby 
5 Digital, or Linear-PCM, the sampling frequency (such as 
48kHz) , a bitrate when a fixed bitrate is used, or a 
bitrate marked with "VBR" when a variable bitrate is 
used. 

The time map table shows the size of each VOBU 
10 that composes the VOB and the reproduction period of 
each VOBU. To improve accessing capabilities, 
representative VOBUs are selected at a predetermined 
interval, such as a multiple of ten seconds, and the 
addresses and reproduction times of these representative 
15 VOBUs are given relative to the start of the VOB. 

(1-3-3) Seamless Linking Information 

The seamless linking information is information 
which enables the consecutive reproduction of the 
20 plurality of VOBs in the AV file to be performed 

seamlessly. This seamless linking information includes 
the seamless flag, the video presentation start time 
VOB__V__S_PTM, the video presentation end time 
VOB_V_E_PTM, the First_SCR, the Last_SCR, the audio gap 
25 start time A_STP_PTM, the audio gap length A_J3AP_LEN, 
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and the audio gap location information A_GAP_LOC. 

(1-3-3-1) Seamle ss Flag 

The seamless flag is a flag showing whether the 
5 VOB corresponding the present seamless linking 

information is reproduced seamlessly following the end 
of reproduction of the VOB positioned immediately before 
the present VOB in the AV file. When this flag is set 
at "01", the reproduction of the present VOB (the latter 
10 VOB) is performed seamlessly, while when the flag is set 
at "00", the reproduction of the present VOB is not 
produced seamlessly. 

In order to perform the reproduction of a 
plurality of VOBs seamlessly, the relationship between 
15 the former VOB and the latter VOB must be as follows. 

(1) Both VOBs must use the same display method 
(NTSC, PAL, etc.) for the video stream as given in the 
video attribute information. 

(2) Both VOBs must use the same encoding method 
20 (AC-3, MPEG, Linear-PCM) for the audio stream as given 

in the audio attribute information. 

Failure to comply with the above conditions 
prevents seamless reproduction from being performed. 
When a different display method is used for a video 
25 stream or a different encoding method is used for an 
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audio stream,, the video encoder and audio encoder will 
have to stop their respective operations to switch the 
display method, decoding method, and/or bit rate. 

As one example, when two audio streams that are 
5 to be reproduced consecutively are such that the former 
audio stream has been encoded according to AC-3 methods 
and the latter according to MPEG methods, an audio 
decoder will have to stop decoding to switch the stream 
attributes when the stream switches from AC-3 to MPEG. 

10 A similar situation also occurs for a video decoder when 
the video stream changes. 

The seamless flag is only set to "01" when both 
of the above conditions (1) and (2) are satisfied. If 
any one of the above conditions (1) and (2) is not 

15 satisfied, the seamless flag is set at "00". 



(1-3-3-2) Video Presenta tion Start Time VOB_V_S_ptm 

The video presentation start time VOB_V_S__PTM 
shows the time at which reproduction of the first video 
20 field in the video streams composing a VOB is to start. 
This time is given in PTM descriptor format. 

PTM descriptor format is a format whereby the 
time is expressed with an accuracy of 1/27,000,000 
seconds or 1/90,000 seconds (=300/27,000,000 seconds). 
25 This accuracy of 1/90,000 seconds is set considering the 
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common multiples of the frame frequencies of NTSC 
signals, PAL signals, Dolby AC- 3, and MPEG Audio, while 
the accuracy of 1/27,000,000 seconds is set considering 
the frequency of the STC. 
5 Fig. 12B shows the PTM descriptor format. In 

this drawing, the PTM descriptor format is composed of a 
base element (PTM_base) that shows the quotient when the 
presentation start time is divided by 1/90,000 seconds 
and an extension element ( PTM_extension) that shows the 
10 remainder when the same presentation start time is 
divided by the base element to an accuracy of 
1/27,000,000 seconds. 

(1-3-3-3) Video Presentation End Time VOB_V_E_PTM 
15 The video presentation end time VOB_V_E_PTM shows 

the time at which reproduction of the last video field 
in the video streams composing a VOB ends. This time is 
also given in PTM descriptor format. 

20 (1-3-3-4) Relation between Video Presentation Start Timfi 
VOB_V_S_PTM and Video Presentation End Time VOB_V_E_PTM 
The following is an explanation of the relation 
between the VOB__V_E_PTM of a former VOB and the 
V0B_V_S_PTM of a latter VOB, when the former VOB and 

25 latter VOB are to be seamlessly reproduced. 
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Since the latter VOB is fundamentally to be 
reproduced after all of the video packs included in the 
former VOB, so that if the VOB_V_S_PTM of the latter VOB 
is not equal to the VOB_V_E__PTM of the former VOB, the 
5 time stamps will not be continuous, meaning that the 
former VOB and latter VOB cannot be reproduced 
seamlessly. However, when the two VOBs have been 
encoded completely separately, the encoder will have 
assigned a unique time stamp to each video pack and 

10 audio pack during encoding, so that the condition for 
the VOB_V_S_PTM of the latter VOB to be equal to the 
VOB_V_E_PTM of the former VOB becomes problematic. 

Fig. 13 shows the state of the buffer for the 
former VOB and the latter VOB. In the graphs in Fig. 

15 13, the vertical axis shows the occupancy of the buffer 
while the horizontal axis represents time. The times 
representing the SCR, PTS, video presentation end time 
VOB__V_E_PTM, and video presentation start time 
VOB_V_S__PTM have been plotted. In Fig. 11B, the picture 

20 data that is reproduced last in the former VOB is 

inputted into the video buffer by the time indicated as 
Last_SCR of the video pack composed by this picture 
data, with the reproduction processing of this data 
waiting until the PTS that is the presentation start 

25 time is reached (if the last pack inputted into an MPEG 
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decoder is an audio or other pack, this condition is not 
valid) . Here, video_presentation_end_time VOB_V__E__PTM 
shows the point where the display period hi of this 
final video has expired starting from this PTS . This 
display period hi is the period taken to draw an image 
from the first field that composes one screen-sized 
image to the final field. 

In the lower part of Fig. 11B, the picture data 
that should be displayed first in the latter VOB is 
inputted into the video buffer at the time First_SCR, 
with the reproduction of this data waiting until the PTS 
indicating the presentation start time. In this 
drawing, the video packs of the former and latter VOBs 
are respectively assigned an SCR with the first value 
"0", a video presentation end time VOB_V_E_PTM, and a 
video presentation start time VOB_V_S__PTM . For this 
example, it can be seen that VOB_V_S_PTM of latter VOB < 
VOB_V_E_PTM of former VOB. 

The following is an explanation of why seamless 
reproduction is possible even for the condition 
VOB_V_S_PTM of latter VOB < VOB_V_E_PTM of former VOB. 
Under DVD-RAM standard, an extended STD model 
(hereinafter "E-STD") is defined as the standard model 
for the reproduction apparatus, as shown in Fig. 19. In 
general, an MPEG decoder has an STC (System Time Clock) 
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for measuring a standard time, with the video decoder 
and audio decoder referring to the standard time shown 
by the STC to perform decode processing and reproduction 
processing. In addition to the STC, however, E-STD has 
5 an adder for adding an offset to the standard time 

outputted by the STC, so that either of the standard 
time outputted by the STC and the addition result of the 
adder may be selected and outputted to the video decoder 
and the audio decoder. With this construction, even if 

10 the time stamps for different VOBs are not continuous, 
the output of the adder may be supplied to the decoder 
to have the decoder behave as if the time stamps of the 
VOBs were continuous. As a result, seamless 
reproduction is still possible even when the VOB_V_E_PTM 

15 of former VOB and the VOB_V_S_PTM of latter VOB are not 
continuous, as in the above example. 

The difference between the VOB_V_S_PTM of latter 
VOB and the VOB_V_E_PTM of former VOB can be used as the 
offset to be added by the adder. This is normally 

20 referred to as the "STC_of f set " . As a result, a 

reproduction apparatus of the E-STD model finds the 
STC_offset according to the formula shown below which 
uses the VOB_V_S_PTM of latter VOB and the VOB_V_E_PTM 
of former VOB. After finding the STC_offset, the 

25 reproduction apparatus then sets the result in the 
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adder . 

STC_offset = VOB_V_E_PTM of former VOB - 
VOB_V_S_PTM of latter VOB 

5 

The reason the VOB_V_SJPTM of latter VOB and the 
VOB__V_E_PTM of former VOB are written in the seamless 
linking information is to enable the decoder to perform 
the above calculation and set the STC_offset in the 
10 adder. 

Fig. HE is a graph that has been plotted for two 
VOBs in each of which the time stamps are continuous, as 
shown in Fig. HA. The time stamp of the first pack in 
VOB#l includes the initial value Initl, with the packs 

15 following thereafter having increasingly higher values 
as their time stamps. In the same way, the time stamp 
of the first pack in VOB#2 includes the initial value 
Init2, with the packs following thereafter having 
increasingly higher values as their time stamps. In 

20 Fig. HE, the final value of the time stamps in VOB#l is 
higher than the first value of the time stamps in VOB#2, 
so that it can be seen that the time stamps are not 
continuous across the two VOBs. When the decoding of 
the first pack in VOB#2 is desired following the final 

25 pack of VOB#l regardless of the non-continuity of time 
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stamps, an STC_offset can be added to the time stamps in 
V0B#2, thereby shifting the time stamps in V0B#2 from 
the solid line shown in Fig. HE to the broken line that 
continues as an extension of the time stamps in V0B#1. 
5 As a result, the shifted time stamps in V0B#2 can be 
seen to be continuous with the time stamps in V0B#1. 



(1-3-3-5) First_SCR 

The First_SCR shows the SCR of the first pack in 
10 a VOB, written in PTM descriptor format. 

(1-3-3-6) Last_SCR 

The Last_SCR shows the SCR of the last pack in a 
VOB, written in PTM descriptor format. 

15 

(1-3-3-7) Relati onship bet ween the First_SCR ^nd 
Last_SCR 

As described above, since the reproduction of VOB 
is performed by a decoder of E-STD type, the Last_SCR of 
20 the former VOB and the First_SCR of the latter VOB do 
not need to satisfy the condition that Last_SCR of 
former VOB = First_SCR of latter VOB. However when 
using an STC_offset, the following relationship must be 
satisfied . 

25 
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Last_SCR of former VOB + time required by 1 pack 
transfer ^ STC_offset + First_SCR of latter VOB 

Here, if the Last_SCR of former VOB and the 
5 First_SCR of latter VOB do not satisfy the above 

equation, this means that the packs that compose the 
former VOB are transferred into the video buffer and 
audio buffer at the same time as the packs that compose 
the latter VOB. This violates MPEG standard and the 

10 decoder model of E-STD where packs are transferred one 
at a time in the order of the pack sequence. By 
referring to Fig. 10C, it can be seen that the Last_SCR 
of former VOB matches the FirstJSCR of latter 
VOB+STC_of f set, so that the above relationship is 

15 satisfied. 

When VOB is reproduced using decoder of E-STD 
type, of particular note is the time at which switching 
is performed between outputting the standard time 
outputted by the STC and outputting the standard time 

20 with the offset added by the adder. Since no 

information for this switching is given in the time 
stamps of a VOB, there is the risk that the improper 
timing will be used for switching to the output value of 
the adder . 

25 First_SCR and Last_SCR are effective for 
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informing the decoder of the correct timing to switch to 
the output value of the adder. While the STC is 
counting, the decoder compares the standard time 
outputted by the STC with the First_SCR and Last__SCR. 
5 When the standard time outputted by the STC matches the 
First__SCR or Last__SCR, the decode switches from the 
standard time outputted by the STC to the output value 
of the adder . 

When reproducing a VOB, standard reproduction 

10 reproduces the latter VOB after reproducing the former 
VOB, while "rewind reproduction" (backward picture 
search) reproduces the former VOB after the latter VOB. 
Accordingly, the LastJSCR is used for switching the 
value used by the decoder during standard reproduction, 

15 and First_SCR is used for switching the value used by 
the decoder during rewind reproduction. During rewind 
reproduction, the latter VOB is decoded starting from 
the last VOBU to the first VOBU, and when the first 
video pack in the latter VOB has been decoded, the 

20 former VOB is decoded starting from the last VOBU to the 
first VOBU. In other words, during rewind reproduction, 
the time at which the decoding of the first video pack 
in the latter VOB is complete is the time at which the 
value used by the decoder needs to be switched. To 

25 inform a video data editing apparatus of E-STD type of 
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this time, the First_SCR of each VOB is provided in the 
RTRW management file. 

A more detailed explanation of the techniques 
used for E-STD and the STC_offset is given in the PCT 
Publication W097/13364. 

(1-3-3-8) Audio Gap Start Time "A_STP_PTM" 

When an audio reproduction gap exists in a VOB, 
the audio gap start time "A_STP_PTM" shows the halt 
start time at which the audio decoder should halt its 
operation. This audio gap start time is given in PTM 
descriptor format. One audio gap start time A_STP_PTM 
is indicated for one VOB. 

(1-3-3-9) Audio Gap Length "A_GAP_LEN" 

The audio gap length A_GAP__LEN " shows how long 
the audio decoder should stop its operation starting 
from the halt start time indicated as the audio gap 
start time "A_STP__PTM" . The length of this audio gap 
length A_GAP_LEN is restricted to being less than the 
length of one audio frame. 

(1-3-3-10) Inevitability of Audio Gap 

The following is an explanation of why a period 
where an audio gap occurs needs to be specified by the 

82 



WO 99/14935 



PCT/JP98/04174 



audio gap start time A_STP__PTM and audio gap length 
A__GAP_LEN . 

Since video streams and audio streams are 
reproduced with different cycles, the total reproduction 
5 time of a video stream contained in a VOB does not match 
the total reproduction time of the audio stream. For 
example, if the video stream is for NTSC standard and 
the audio stream is for Dolby-AC3, the total 
reproduction time of the video stream will be an integer 

10 multiple of 33msec and the total reproduction time of 

the audio stream will be an integer multiple of 32msec, 
as shown in Fig. 14A. 

If seamless reproduction of two VOBs is performed 
without regard to these differences in total 

15 reproduction time, it will be necessary to align the 

reproduction time of one set of the picture data and the 
reproduction time of the audio data to synchronize the 
reproduction of the picture data with the audio data. 
In order to align such reproduction times, a difference 

20 in total time appears at one of the start or the end of 
the picture data or audio data. 

In Fig. 14B, the reproduction time of the picture 
data is aligned with the reproduction time of the audio 
data at the start of a VOB, so that the time difference 

25 gl is present at the end of the picture data and audio 
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data . 

Since the time difference gl is present at the 
end of VOB#l, when seamless reproduction V0B#1 and V0B#2 
is attempted, the reproduction of the audio stream in 
5 VOB#2 is performed to fill in the time difference gl, 
meaning that the reproduction of the audio stream in 
V0B#2 starts at time gO . The audio decoder uses a fixed 
frame rate when reproducing an audio stream, so that the 
decoding of audio streams is continuously performed with 

10 a fixed cycle. When VOB#2 that is to be reproduced 

following VOB#l has already been read from the DVD- RAM, 
the audio decoder can commence the decoding of VOB#2 as 
soon as it has completed the decoding of the audio 
stream in VOB#l. 

15 To prevent the audio stream in the next VOB from 

being reproduced too early during seamless reproduction, 
the audio gap information in the stream is managed on 
the host side of a reproduction apparatus, so that 
during audio gap period, the host needs to halt the 

20 operation of the audio decoder. This reproduction halt 
period is the audio gap, and starts from the audio gap 
start time A_JSTP_PTM and continues for the period 
indicated as A_GAP_LEN. 

Processing to specify audio gaps is also 

25 performed within a stream. More specifically, the PTS 
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of an audio frame immediately after an audio gap is 
written in the packet header of an audio packet, so that 
it is possible to specify when the audio gap ends. 
However, problems arise with the specifying method when 
5 several sets of audio data that should be reproduced for 
several audio frames are stored in a single audio 
packet. In more detail, when several sets of audio data 
to be reproduced for several audio frames are stored in 
a single audio packet, it is only possible to provide a 

10 PTS for the first out of the plurality of audio frames 
in this packet. In other words, a PTS cannot be 
provided for the remaining audio frames in the packet. 
If the audio data that is to be reproduced for the audio 
frames located both before and after an audio gap is 

15 arranged into the same packet, it will not be possible 

to provide a PTS for the audio frame located immediately 
after the audio gap. As a result, it will not be 
possible to specify the audio gap, meaning that the 
audio gap will be lost. To avoid this, the audio frame 

20 located immediately after an audio gap is processed so 
as to be arranged at the front of the next audio pack, 
so that the PTS (audio gap start time A_STP_PTM and 
audio gap length A__GAP_LEN) of the audio frame 
immediately after the audio gap can be clarified within 

25 the stream. 
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Whenever necessary, a Padding-Packet , as 
prescribed by MPEG standard, may be inserted immediately 
after audio data in an audio packet that stores the 
audio data to be reproduced immediately before an audio 
5 gap. Fig. 14C shows audio pack G3 which includes an 

audio gap which includes the audio data y-2,y-l,y to be 
reproduced for the audio frames y-2,y-l,y located at the 
latter part of V0B#1 shown in Fig. 14B and a 
Padding_Packet . This drawing also shows audio pack G4 

10 that includes the audio frames u+1, u+2, and u+3 that 
are positioned at the front of VOB#2 . 

The above-mentioned audio pack G4 is the pack 
that includes the audio data that is to be reproduced 
for the audio frame immediately after the audio gap, 

15 while audio pack G3 is the pack that is located in 
immediately before this pack. 

If the audio data to be reproduced for the audio 
frame located immediately after the audio gap is 
included in a pack, the pack located immediately before 

20 such pack is called an "audio pack including an audio 
gap" . 

Here, the audio pack G3 is positioned toward the 
end of the video pack sequence in a VOBU, with no 
picture data with a later reproduction time being 
25 included in VOB#l. However, it is assumed that the 
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reproduction of VOB#2 will follow the reproduction of 
V0B#1, so that picture data included in V0B#2 is the 
picture data that should be read corresponding to audio 
frames y-2, y-1, and y. 
5 If this is the case, the audio pack G3 that 

includes the audio gap may be positioned within any of 
the first three VOBU in V0B#2 without violating the 
"one-second rule". Fig. 14D shows that this audio pack 
G3 that includes the audio gap may be positioned within 

10 any of VOBU#l, VOBU#2, and VOBU#3 at the start of VOB#2 . 

The operation of the audio decoder needs to be 
temporarily halted for the period of the audio gap. 
This is because the audio decoder will try to perform 
the decode processing even during the audio gap, so that 

15 the host control unit that performs the core control 

processing in a reproduction apparatus has to indicate 
an audio pause to the decoder once the reproduction of 
picture data and audio data has ended, thereby 
temporarily halting the audio decoder. This indication 

20 is shown as the ADPI (Audio Decoder Pause Information) 
in Fig. 19. 

By doing so, the operation of the audio decoder 
can be stopped during the period of the audio gap. 
However, this does not mean that the audio output can be 
25 stopped regardless of how an audio gap appears in the 
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data . 

This is because it is normal for the control unit 
to be composed of a standard microcomputer and software, 
so that depending on the circumstances for stopping the 
5 operation of the audio decoder, should audio gaps 

repeatedly occur during a short period of time, there is 
the possibility of the control unit not issuing the halt 
indication sufficiently early. As one example, when 
VOBs of approximately one second in length are 

10 reproduced consecutively, it becomes necessary to give a 
halt indication to the audio decoder at intervals of 
around one second. When the control unit is composed of 
a standard microcomputer and software, there is the 
possibility that the control unit will not be able to 

15 halt the audio decoder for the period where such audio 
gaps are present. 

When reproducing VOBs, the reproduction time of 
picture data and the reproduction time of audio data 
have been aligned several times, with it being necessary 

20 to provide the audio decoder with a halt indication 
every time. When the control unit is composed of a 
standard microcomputer and software, there is the 
possibility that the control unit will not be able to 
halt the audio decoder for the period where such audio 

25 gaps are present. For this reason, the following 
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restrictions are enforced so that audio gaps only occur 
once within a predetermined period. 

First, to allow the control unit to perform the 
halt operation with ease, the reproduction period of 
5 VOBs is set at 1.5 seconds or above, thereby reducing 
the frequency with audio gaps may occur. 

Second, the alignment of the reproduction time of 
picture data and the reproduction time of audio data is 
only performed once in each VOB. By doing so, there 
10 will only be one audio gap in each VOB. 

Third, the period of each audio gap is restricted 
to being less than one audio frame. 

Fourth, the audio gap start time VOB__A_STP_PTM is 
set with the video presentation start time VOB_V__S PTM 
15 of the following VOB as a standard, so that the audio 
gap start time VOB_A__STP_PTM is restricted to being 
within one audio frame of the following video 
presentation start time VOB_V_S_PTM . 

As a result, 

20 VOB_V_S_PTM - reproduction period of one audio frame 

< A STP PTM ^ VOB V S PTM 



If an audio gap that satisfies the above formula 
occurs, the first image in the following VOB will just 
25 have been displayed, so that even if there in no audio 
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output at this time, this will not be particularly 
conspicuous to the user. 

By providing the above restriction, when audio 
gaps appear during seamless reproduction, the interval 
5 between the audio gaps will be at least "1.5 seconds - 
reproduction period of two audio frames". More 
specifically, by substituting actual values, the 
reproduction period of each audio frame will be 32msec 
when Dolby AC 3 is used, so that the minimum interval 
10 between audio gaps is 1436msec. This interval means 
that there is a high probability of the control unit 
being able to perform the halt control processing well 
within the deadline for the processing. 



15 (1-3-3-11) Audio Gap Location Information 

The audio gap location information "A_GAP_LOC" is 
a 3-bit value that shows into which of the three VOBs 
located at the start of the latter VOB the audio pack 
including the audio gap has been inserted. When the 

20 first bit in this value is "1", this shows the audio gap 
is present in VOBU#l. In the same way, the values "2" 
and "3" respectively show that the audio gap is present 
in VOBU#2 or VOBU#3. 

The reason this flag is necessary is that it will 

25 be necessary to regenerate the audio gap when the latter 
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of two VOBs that are to be seamlessly reproduced has 
been partially deleted. 

The partial deletion of the VOB refers to the 
deletion of a plurality of VOBUs that are located at the 
5 start or the end of a VOB. As one example, there are 

many cases during video editing when the user wishes to 
remove the opening credit sequence. The deletion of the 
VOBUs which include this opening credit sequence is 
called the "partial deletion of a VOB" . 

10 When performing partial deletion, audio packs 

including an audio gap that are moved to a latter VOB 
require special attention. As described above, the 
audio gap is determined according to the video 
presentation start time VOB_V_jS_PTM of the latter VOB, 

15 so that when some of the VOBUs are deleted from the 
latter VOB, the picture data that has the video 
presentation start time VOB_V_S_PTM that determines the 
audio gap and the VOBUs for this picture data will be 
deleted. 

20 The audio gap is multiplexed into the one of the 

first three VOBs at the start of a VOB. Accordingly, 
when a part of a VOB, such as the first VOBU, is 
deleted, it will not be clear as to whether the audio 
gap will have been destroyed as a result of this 

25 deletion. Since the number of audio gaps that may be 
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provided within one VOB is limited to one, it is also 
necessary to delete a previous audio gap that is no 
longer needed once a new audio gap has been generated. 
As shown in Fig. 14D, the audio pack G3 that 
5 includes the audio gap needs to be inserted into one of 
VOBU#l to VOBU#3 in V0B#2 so as to comply to the one- 
second rule, so that the audio pack that includes this 
audio gap needs to be taken out of the packs included in 
V0B#1 to V0B#3. While this involves a maximum of three 

10 VOBUs, the immediate extraction of only the audio pack 
G3 that includes the audio gap is technically very 
difficult. This means that stream analysis is required. 
Here, each VOBU includes several hundred packs so that a 
significant amount of processing is required to refer to 

15 the content of all such packs. 

The audio gap location information A_GAP_LOC uses 
a 3-bit flag to show into which of the three VOBUs at 
the start of a latter VOB an audio pack including an 
audio gap has been inserted, so that only one VOBU needs 

20 to be searched when looking for the audio gap. This 
facilitates the extraction of the audio pack G3 
including the audio gap. 

Fig. 15A to 15E show the procedure for the 
regeneration of the audio gap by the video data editing 

25 apparatus when the VOBUs located at the start of VOB#2 
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have been deleted, out of two VOBs, V0B#1 and V0B#2, 
that are to be reproduced seamlessly. 

As shown in Fig. 15A, the VOBUs, "VOBU#98", 
"VOBU#99", and "VOBU#100" are located at the end of 
5 V0B#1 and the VOBUs, "VOBU#l", "V0BU#2", and "V0BU#3" 

are located at the start of V0B#2 . In this example, the 
user instructs the video data editing apparatus to 
perform a partial deletion to delete V0BU#1 and V0BU#2 
in V0B#2. 

10 In this case, the audio pack G3 that includes the 

audio gap is required, out of the audio data stored in 
VOBU#100, but it is known for certain that this audio 
pack G3 including the audio gap will be arranged into 
one of VOBU#l, VOBU#2, and VOBU#3 in VOB#2. To find the 

15 VOBU into which the audio pack G3 including the audio 

gap has been arranged, the video data editing apparatus 
refers to the audio gap location information A_GAP_LOC. 
When the audio gap location information A_GAP_LOC is set 
as shown in Fig. 15B, it can be seen that the audio pack 

20 G3 including the audio gap is located in VOBU#3 in 
VOB#2 . 

Once the video data editing apparatus knows that 
the audio pack G3 including the audio gap is located in 
VOBU#3, the video data editing apparatus will know 
25 whether the audio gap was multiplexed into the area that 
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was subjected to the partial deletion. In the present 
example, the audio gap is not included in the deleted 
area, so that the value of A_GAP_LOC is only amended by 
the number of VOBU that were deleted. 
5 This completes the explanation of the VOBs, video 

stream, audio stream, and VOB information that is stored 
on an optical disc for the present invention. 

(1-4) System Construction of the Video Data Editing 

10 Apparatus 

The video data editing apparatus of the present 
embodiment is provided with functions for both a DVD-RAM 
reproduction apparatus and a DVD-RAM recording 
apparatus. Fig. 16 shows an example of the system 

15 construction that includes the video data editing 

apparatus of the present embodiment. As shown in Fig. 
16, this system includes a video data editing apparatus 
(hereinafter DVD recorder 70), a remote controller 71, a 
TV monitor 72 that is connected to the DVD recorder 70, 

20 and an antenna 73. The DVD recorder 70 is conceived as 
a device to be used in place of a conventional video 
cassette recorder for the recording of television 
broadcasts, but also features editing functions. The 
system illustrated in Fig. 16 shows the case when the 

25 DVD recorder 70 is used as a domestic video editing 
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apparatus. The DVD-RAM described above is used by the 
DVD recorder 70 as the recording medium for recording 
television broadcasts. 

When a DVD-RAM is loaded into the DVD recorder 
5 70, the DVD recorder 70 compresses a video signal 
received via the antenna 73 or a conventional NTSC 
signal and records the result onto the DVD-RAM as VOBs. 
The DVD recorder 70 also decompresses the video streams 
and audio streams included in the VOBs recorded on a 
10 DVD-RAM and outputs the resulting video signal or NTSC 
signal and audio signal to the TV monitor 72. 



(1-4-1) Hardware Construction of the DVD Recorder 70 

Fig. 17 is a block diagram showing the hardware 

15 construction of the DVD recorder 70. As shown in Fig. 

17, the DVD recorder 70 is composed of a control unit 1, 
an MPEG encoder 2, a disc access unit 3, an MPEG decoder 
4, a video signal processing unit 5, a remote controller 
71, a bus 7, a remote control signal reception unit 8, 

20 and a receiver 9. 

The arrows drawn with solid lines in Fig. 17 show 
the physical connections that are achieved by the 
circuit wiring inside the DVD recorder 70. The broken 
lines, meanwhile, show the logical connections that 

25 indicate the input and output of various kinds of data 
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on the connections shown with the solid lines during a 
video editing operation. The numerals (1) to (5) 
assigned to the broken lines show how VOBUs and the 
picture data and audio data that composes VOBUs are 
5 transferred on the physical connections when the DVD 
recorder 70 re-encodes VOBUs. 

The control unit 1 is the host-side control unit 
that includes the CPU la, the processor bus lb, the bus 
interface lc, the main storage Id, and the ROM le. By 

10 executing programs stored in the ROM le, the control 
unit 1 records, reproduces, and edits VOBs . 

The MPEG encoder 2 operates as follows. When the 
receiver 9 receives an NTSC signal via the antenna 73, 
or when a video signal outputted by a domestic video 

15 camera is received via the video input terminals 

provided at the back of the DVD recorder 70, the MPEG 
encoder 2 encodes the NTSC signal or video signal to 
produce VOBs and outputs the generated VOBs to the disc 
access unit 3 via the bus 7. As a process that 

20 particularly relates to video editing, the MPEG encoder 
2 receives an input of the decoding result of the MPEG 
decoder 4 from the connection line CI via the bus 7, as 
shown by the broken line (4), and outputs the encoding 
result for this data to the disc access unit 3 via the 

25 bus 7, as shown by the broken line (5) . 
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The disc access unit 3 includes a track buffer 
3a, an ECC processing unit 3b, and a drive mechanism 3c 
for a DVD-RAM, and accesses the DVD-RAM in accordance 
with control by the control unit 1. 
5 In more detail, when the control unit 1 gives an 

indication for recording on the DVD-RAM and the VOBs 
encoded by the MPEG encoder 2 have been successively 
outputted as shown by the broken line (5) , the disc 
access unit 3 stores the received VOBs in the track 
10 buffer 3a, and, once ECC processing has been performed 
by the ECC processing unit 3b, controls the drive 
mechanism 3c to successively record these VOBs onto the 
DVD-RAM. 

On the other hand, when the control unit 1 
15 indicates a data read from a DVD-RAM, the disc access 
unit 3 controls the drive mechanism 3c to successively 
read VOBs from the DVD-RAM, and, once the ECC processing 
unit 3b has performed ECC processing on these VOBs, 
stores the result in the track buffer 3a. 
20 The drive mechanism 3c mentioned here includes a 

platter for setting the DVD-RAM, a spindle motor for 
clamping and rotating the DVD-RAM, an optical pickup for 
reading a signal recorded on the DVD-RAM, and an 
actuator for the optical pickup. Reading and writing 
25 operations are achieved by controlling these components 
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of the drive mechanism 3c, although such control does 
not form part of the gist of the present invention. 
Since such control can be achieved using well-known 
methods, no further explanation will be given in this 
5 specification. 

When VOBs that have been read from the DVD-RAM by 
the disc access unit 3 are outputted as shown by the 
broken line (1), the MPEG decoder 4 decodes these VOBs 
to obtain uncompressed digital video data and an audio 

10 signal. The MPEG decoder 4 outputs the uncompressed 

digital video data to the video signal processing unit 5 
and outputs the audio signal to the TV monitor 72. 
During a video editing operation, the MPEG decoder 4 
outputs the decoding result for a video stream and audio 

15 stream to the bus 7 via the connections lines C2, C3, as 
shown by the broken lines (2) and (3) in Fig. 17. The 
decoding result outputted to the bus 7 is outputted to 
the MPEG encoder 2 via the connection line CI, as shown 
by the broken line (4). 

20 The video signal processing unit 5 converts the 

image data outputted by the MPEG decoder 4 into a video 
signal for the TV monitor 72. On receiving graphics 
data from outside, the video signal processing unit 5 
converts the graphics data into an image signal and 

25 performs signal processing to combine this image signal 
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with the video signal. 

The remote control signal reception unit 8 
receives a remote controller signal and informs the 
control unit 1 of the key code included in the signal so 
5 that the control unit 1 can perform control in 

accordance with user operations of the remote controller 
71. 

(1-4-1-1) Internal Construction of the MPEG Encoder 2 
10 Fig. 18 is a block diagram showing the 

construction of the MPEG encoder 2. As shown in Fig. 
18, the MPEG encoder 2 is composed of a video encoder 
2a, a video buffer 2b for storing the output of the 
video encoder 2a, an audio encoder 2c, an audio buffer 
15 2d for storing the output of the audio encoder 2c, a 
stream encoder 2e for multiplexing the encoded video 
stream in the video buffer 2b and the encoded audio 
stream in the audio buffer 2d, an STC (System Time 
Clock) unit 2f for generating the synchronization clock 
20 of the MPEG encoder 2, and the encoder control unit 2g 
for controlling and managing these components of the 
MPEG encoder 2 . 



25 



(1-4-1-2) Internal Construction of the MPEG decoder 4 
Fig. 19 shows the construction of the MPEG 
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decoder 4. As shown in Fig. 19, the MPEG decoder 4 is 
composed of a demultiplexer 4a, a video buffer 4b, a 
video decoder 4c, an audio buffer 4d, an audio decoder 
4e, a reordering buffer 4f, an STC unit 4g, switches SW1 
5 to SW4, and a decoder control unit 4k. 

The demultiplexer 4a refers to the header of a 
packet that has been read from a VOB and judges whether 
the various packs are video packs or audio packs. The 
demultiplexer 4a outputs video data in packs judged to 

10 be video packs to the video buffer 4b and audio data in 
packs judged to be audio packs to the audio buffer 4d. 

The video buffer 4b is a buffer for accumulating 
video data that is outputted by the demultiplexer 4a. 
Each set of picture data in the video buffer 4b is 

15 stored until its decode time when it is taken from the 
video buffer 4b. 

The video decoder 4c takes out sets of picture data 
from the video buffer 4b at their respective decode 
times and instantly decodes the data. 

20 The audio buffer 4d is a buffer for accumulating 

the audio data outputted by the demultiplexer 4a. 

The audio decoder 4e successively decodes the audio 
data stored in the audio buffer 4d in frame units. On 
receiving ADPI (Audio Decoder Pause Information) issued 

25 by the control unit 1, the audio decoder 4e halts the 
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decode processing for audio frame data. The ADPI is 
issued by the control unit 1 when the present time 
reaches the audio gap start time A_STP_PTM shown by the 
seamless linking information. 
5 The reordering buffer 4f is a buffer for storing 

the decoding result of the video decoder 4c when it has 
decoded an I picture or P picture. The reason the 
decoding results for I pictures or P pictures are stored 
is that the encoding order was originally produced by 

10 rearranging the display order. Accordingly, after every 
B picture that should be displayed before the decoding 
results stored in the reordering buffer 4f has been 
decoded, the reordering buffer 4f outputs the decoding 
results of the hitherto stored I pictures and P pictures 

15 as an NTSC signal. 

The STC unit 4g generates the synchronization clock 
that shows the system clock for use in the MPEG decoder 
4 . 

The adder 4h outputs a value produced by adding the 
20 STC_offset to the standard clock shown by the 

synchronization clock as the offset standard clock. The 
control unit 1 calculates this STC__offset by finding the 
difference between the video presentation start time 
VOB_V_S_PTM and the video presentation end time 
25 VOB_V__E_PTM that are given in the seamless linking 
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information, and sets the STC__offset in the adder 4h. 

The switch SW1 supplies the demultiplexer 4a with 
the standard time measured by the STC unit 4g or the 
offset standard time outputted by the adder 4h. 
5 The switch SW2 supplies the audio decoder 4e with 

the standard time measured by the STC unit 4g or the 
offset standard time outputted by the adder 4h. The 
supplied standard time or offset standard time is used 
to collate the decode time and presentation start time 

10 of each audio frame. 

The switch SW3 supplies the video decoder 4c with 
the standard time measured by the STC unit 4g or the 
offset standard time outputted by the adder 4h. The 
supplied standard time or offset standard time is used 

15 to collate the decode time of each set of picture data. 

The switch SW4 supplies the reordering buffer 4f 
with the standard time measured by the STC unit 4g or 
the offset standard time outputted by the adder 4h. The 
supplied standard time or offset standard time is used 

20 to collate the presentation start time of each set of 
picture data. 

The decoder control unit 4k receives a decode 
processing request from the control unit 1 for an 
integer multiple of VOBUs, which is to say an integer 

25 multiple of GOPs, and has the decode processing 
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performed by all of the components from the 
demultiplexer 4a to the reordering buffer 4f . Also, on 
receiving a valid/invalid indication for the 
reproduction output of the decoding result, the decoder 
5 control unit 4k has the decoding results of the video 
decoder 4c and the audio decoder 4e outputted to the 
outside if the indication is valid, or prohibits the 
output of the decoding results of the video decoder 4c 
and the audio decoder 4e to the outside if the 

10 indication is invalid. 

The valid/invalid indication can be given for a 
smaller unit that a video stream, such as for a video 
field. Information that indicates the valid section of 
the reproduction output in video field units is called 

15 valid reproduction section information. 

(1-4-1-2-1) Timing for t he Switching of Switches SW1 ~SW4 

Fig. 20 is a timing chart of the timing for the 
switching of switches SW1 to SW4 . This timing chart 

20 shows the switching of switches SW1 to SW4 when seamless 
reproduction of VOB#l and V0B#2 is performed. The upper 
part of Fig. 20 shows the pack sequences that compose 
VOB#l and VOB#2, while the middle part shows the video 
frames and the lower part shows the audio frames. 

25 The timing for the switching of switch SW1 is the 
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point where the pack sequence that is transferred to the 
MPEG decoder 4 changes from V0B#1 to V0B#2 . This time 
is indicated as the Last_SCR in the seamless linking 
information of V0B#1. 
5 The timing for the switching of switch SW2 is the 

point where the all of the audio data in the VOB that is 
stored in the audio buffer 4d before the switching of 
switch SW1, which is to say V0B#1, has been decoded. 

The timing for the switching of switch SW3 is the 
10 point where the all of the video data in the VOB that is 
stored in the video buffer 4b before the switching time 
(Tl) of switch SW#1, which is to say VOB#l, has been 
decoded. 

The timing for the switching of switch SW4 is the 
15 point during the reproduction of VOB#l where the last 
video frame has been reproduced. 

The programs stored in the ROM le include modules 
that enable two VOBs that have been recorded on the DVD- 
RAM to be reproduced seamlessly. 

20 

(1-4-1-2-2) Procedure for t he Seamless Processing of 

VOBs 

Figs. 21 and 22 are flowcharts showing the 
procedure that seamlessly links two VOBs in an AV file. 
25 Figs. 23A and 23B show an analysis of the buffer state 
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for each video pack. Figs. 24A and 25 show the audio 
frames in the audio stream that correspond to the audio 
frames x, x+1, y-1, y, u+1, u+2, and u+3 mentioned in 
Fig. 22. 

5 The following is an explanation of the re-encoding 

of VOBs. In step S102 of Fig. 21, the control unit 1 
performs the calculation VOB_V_E__PTM of former VOB minus 
VOB_V_S_PTM of latter VOB to obtain the STC_offset. 
In step S103, the control unit 1 analyzes the 

10 changes in the occupancy of the buffer from the 

First__SCR of the former VOB to the decode end time of 
all of the data in the former VOB. Figs. 23A and 23B 
show the analysis process for the occupancy of the 
buffer performed in step S103. 

15 When video pack #1 and video pack #2 are included 

in the former VOB as shown in Fig. 23A, the SCR#1, 
SCR#2, and DTS#1 included in these video packs are 
plotted on the time axis. After this, the data size of 
the data included in video pack #1 and video pack #2 is 

20 calculated. 

A line is plotted starting from SCR#1 with the 
bitrate information in the pack header as the gradient, 
until the data size of video pack #1 has been plotted. 
After this, the data size of video pack #2 is plotted 

25 starting from SCR#2 . Next, the data size of the picture 
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data PI that is to be decoded is removed at DTS#1. This 
data size of picture data PI is obtained by analyzing 
the bitstream. 

After plotting the data sizes of the video packs 
5 and picture data in this way, the buffer state of the 
video buffer 4b from the first SCR to the DTS can be 
plotted as a graph. By using the same procedure for all 
of the video data and audio data in a VOB, a graph 
showing the state of the buffer can be obtained, as 

10 shown in Fig. 23B. 

In step S104, the control unit 1 performs the same 
analysis as in step S103 for the latter VOB, and so 
analyzes the changes in the occupancy of the video 
buffer from the First_SCR of the latter VOB to the 

15 decode end time Last_DTS of all the data in the latter 
VOB. 

In step S105, the control unit 1 analyzes the 
changes in the occupancy of the video buffer from the 
First_SCR of the latter VOB + STC_offset to the Last__DTS 

20 of the former VOB. This period from the First_SCR of 

the latter VOB + STC_offset to the Last_DTS of the data 
in the former VOB is when the first picture data of the 
latter VOB is being transferred to the video buffer 4b 
while the last picture data of the former VOB is still 

25 stored in the video buffer 4b. 
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When the video data of the former VOB and the 
latter VOB coexist in the buffer, the buffer state will 
be as shown in Fig. IOC. In Fig. IOC, the video buffer 
4b stores video data of both the former VOB and the 
5 latter VOB during the period from the 

First_SCR+STC_of f set to the Last_DTS, with Bvl+Bv2 
representing the highest occupancy of the video buffer 
4b during this period. 

In step S106, the control unit 1 controls the disc 

10 access unit 3 to read the three VOBUs that are located 
at the end of the former VOB. After this, in step S107 
the control unit 1 controls the disc access unit 3 to 
read the three VOBUs that are located at the front of 
the latter VOB. 

15 Fig. 23C shows the area that should be read from 

the former VOB in step S106. In Fig. 23C, the former 
VOB includes the VOBUs #98~#105, so that the VOBUs #103 
to #105 are read as the VOBUs that include the picture 
data V_END that should be decoded last. Fig. 23D, shows 

20 the area that should be read from the latter VOB in step 
S107. In Fig. 23D, the former VOB includes the VOBUs 
#1~#7, so that when the VOBU #1 comes first, VOBUs #1 
to #3 should be read as the VOBUs that include the 
picture data VJTOP. 

25 According to the one-second rule, there is a 
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possibility that the audio data and picture data that 
should be reproduced within one second are stored across 
three VOBUs, so that by reading the three VOBUs at the 
start and end of a VOB, in step S106, all of the picture 
5 data and audio data to be reproduced between a point one 
second from the presentation end time of the picture 
data V_END located at the end of the former VOB and this 
presentation end time itself can be read together. 

Also, in step S107, all of the picture data and 

10 audio data to be reproduced between the presentation 
start time of the picture data V_TOP located at the 
start of the latter VOB and a point one second after 
this presentation start time can be read together. It 
should be noted that the reads in this flowchart are 

15 performed for VOBU units, although the reads may instead 
be performed for the picture data and audio data that is 
to be reproduced in one second, out of all of the 
picture data and audio data included in a VOBU. In this 
embodiment, the number of VOBUs that correspond to one 

20 second is three, although any number of VOBUs may be re- 
encoded. Reads may alternatively be performed for 
picture data and audio data that is to be reproduced in 
a period longer than one second. 

Next, in step S108 the control unit 1 controls the 

25 demultiplexer 4a to separate the VOBUs for the first 
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part and the last part into a video stream and an audio 
stream, and has the video decoder 4c and the audio 
decoder 4e decode these streams. During normal 
reproduction, the decoding results of the video decoder 
5 4c and the audio decoder 4e will be outputted as video 
and audio. When re-encoding is performed, however, 
these decoding results should inputted into the MPEG 
encoder 2, so that the control unit 1 has the video 
stream and the audio stream of the decoding results 

10 output to the bus 7, as shown by the arrows (2) and (3) 
that are drawn with broken lines in Fig. 17. 

The video stream and the audio stream that are the 
decoding results are transferred via the bus 7 in order 
to the MPEG encoder 2, as shown by the broken line (4) . 

15 After this, the control unit 1 calculates the 

amount of code for the re-encoding of the decoded video 
stream and decoded audio stream by the MPEG encoder 2 . 
First, in step S109, the control unit 1 judges whether 
the accumulated amount of data in the buffer exceeds the 

20 upper limit of the buffer at any point in the decoding 
when the former VOB and the latter VOB coexist in the 
buffer. In the present embodiment, this is achieved by 
judging whether the value Bvl+Bv2 calculated in step 
S105 exceeds the upper limit of the buffer. If this 

25 value does not exceed the upper limit, the processing 
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advances to step S112, or if the value does exceed the 
upper limit, the control unit 1 subtracts the excess 
amount of code A from the calculated amount and assigns 
the resulting amount of code to the decoded VOBU 
5 sequence. 

If the amount of code is decreased, this means the 
picture quality of the video stream will decrease during 
the reproduction of these VOBUs. However, overflows in 
the video buffer 4b must be prevented when seamlessly 

10 linking two VOBs, so that this method that decreases 

picture quality is used. In step Sill, the control unit 
1 controls video decoder 4c to re-encode the decoding 
results of the video decoder 4c and the audio decoder 4e 
according to the amount of code assigned in step S110. 

15 Here, the MPEG encoder 2 performs a decode to 

temporarily convert the pixel values in the video data 
into digital data in a YUV coordinate system. Digital 
data in such a YUV coordinate system is digital data for 
the signals (luminance signal (Y) , chrominance signal 

20 (U,V) ) that specify colors for a color TV, with the 
video decoder 4c re-encoding this digital data to 
produce sets of picture data. The technique used for 
the assigning of an amount of code is that described in 
MPEG2 DIS (Draft International Standard) Test Model 3. 

25 Re-encoding to reduce the amount of code is achieved by 
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processes such as replacing the quantization 
coefficients. Note that the amount of code from which 
the excess amount A has been subtracted may be assigned 
to only the latter VOB or to only former VOB. 
5 In step S112, the control unit 1 calculates which 

part of the decoding result for the audio data taken 
from the former VOB corresponds to the audio frame x 
that includes the STC_of f set+First_SCR of the latter 
VOB. In Fig. 24A, the graph shows the buffer state for 

10 the former VOB and latter VOB, while the lower part 

shows the audio frames of the audio data separated from 
the former VOB and the audio frames of the audio data 
separated from the latter VOB. The audio frame 
sequences in the lower part of Fig. 24A show the 

15 correspondence between each audio frame and the time 
axis of the graph in the upper part. The descending 
line drawn from the point shown as First__SCR+STC_of f set 
in the graph intersects one audio frame out of the audio 
frame sequence for the former VOB. 

20 The audio frame that intersects this descending 

line is the audio frame x, and the audio frame x+1 
following immediately after is the final audio data 
included in the former VOB. It should be noted that the 
data in the audio frames x and x+1 is included in the 

25 audio data that should be reproduced during a period 
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that is indicated by points 1.0 seconds before and after 
the reproduction period of the final picture data V_END, 
with this being included in the three VOBUs read in step 
S105. 

5 Fig. 24B shows the case where the 

First_SCR+STC_of f set matches an audio frame boundary in 
the former VOB. In this case, the audio frame 
immediately before the boundary is set as the audio 
frame x . 

10 In step S113, the control unit 1 calculates the 

audio frame y+1 that includes the STC_of f set+VOB_V_S_PTM 
of the latter VOB. In Fig. 24A, the descending line 
drawn from the video presentation start time 
VOB_V_S_PTM+STC_of f set in the graph intersects one audio 

15 frame in the audio frame sequence of the latter VOB. 

The audio frame that intersects this descending line is 
the audio frame y+1. Here, the audio frames up to the 
preceding audio frame y are the valid audio frames that 
still used after the editing has been performed, out of 

20 the original audio data included in the former VOB. 
Fig. 24C shows the case where the video 
presentation start time VOB_V_S_PTM+STC_of f set matches 
an audio frame boundary in the former VOB. In this 
case, the audio frame immediately before the video 

25 presentation start time VOB_V_S_PTM+STC_of f set is set as 
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the audio frame y. 

In step S114, the audio data from the audio frame 
x+2 to the audio frame y is taken from the former audio 
data. In Fig. 24A, the audio frames from audio frame 
5 y+1 onwards have been drawn with a broken line, showing 
that this part is not multiplexed into the VOB. It 
should be noted that the audio frames that have been 
moved to the latter VOB will have been assigned time 
stamps for the former VOB, so that these audio frames 

10 are reassigned time stamps for the latter VOB. 

In step S115, the audio frame u immediately after 
the audio frame that includes the boundary between the 
audio frames y and y+1 is detected from the audio frame 
sequence of the latter VOB. When a descending line is 

15 drawn from the boundary of the audio frames y and y+1, 
this line will intersect one of the audio frames in the 
audio frame sequence of the latter VOB. The audio frame 
that follows this intersected audio frame is the audio 
frame u. 

20 Fig. 24D shows the case where the presentation end 

time of the audio frame y matches an audio frame 
boundary in the latter VOB. In this case, the audio 
frame immediately after this presentation end time is 
set as the audio frame u. 

25 In step S116, the audio pack G4 , which includes an 
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audio data sequence where the audio data reproduced for 
the audio frame u is arranged at the front is generated 
from the audio stream in the latter VOB. In Fig. 24A, 
the audio frames that precede audio frame u have been 
5 drawn with a broken line, with this audio data shown 
using a broken line not being multiplexed into the 
latter VOB. 

As a result of steps S114-S116 above, the audio 
data from the first audio frame to the audio frame x+1 

10 is multiplexed into the former VOB. The audio data from 
the audio frame x+2 to the audio frame y and the audio 
data from the audio frame u to the final audio frame is 
multiplexed into the latter VOB. By performing 
multiplexing in this way, the audio frames for the audio 

15 data at the end of the former VOB will be read from the 
DVD-RAM at the same time as picture data that is to be 
reproduced further ahead in the reproduction. 

At this point, when the audio data in the former 
VOB is not present as far as frame y, which is to say 

20 the audio data is short, silent audio frame data is 

inserted to compensate for the insufficient number of 
frames. In the same way, when the audio data in the 
latter VOB is not present starting from audio frame u, 
which is to say the audio data is short, silent audio 

25 frame data is inserted to compensate for the 
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insufficient number of frames. 

When the audio frames from the audio frame x+2 to 
the audio frame y in the former VOB and the audio data 
from the audio frame u to the final audio frame in the 
5 latter VOB is multiplexed into the latter VOB, attention 
needs to paid to the AV synchronization. 

As shown in Fig. 24A, an reproduction gap occurs 
between the audio frame y and the audio frame u, and if 
multiplexing is performed without regard to this 
10 reproduction gap, a loss of synchronization will occur 
whereby the audio frame u will be reproduced before the 
corresponding video frame. 

To prevent the increase of such time lags between 
audio and video, a time stamp showing the audio frame u 
15 may be assigned to the audio packet. 

To do so, in step S117, a Padding-Packet or 
stuffing bytes are inserted into the pack which includes 
the data of the audio frame y so that the audio frame u 
is not stored in the pack storing the audio frame y. As 
20 a result, the audio frame u is located at the start of 
the next pack. 

In step S118, the VOBU sequence that is located at 
the end of the former VOB is generated by multiplexing 
the audio data up to the audio frame x+1, out of the 
25 audio data extracted from the VOBUs located at the end 
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of this former VOB, with the video data that has been 
re-encoded. 

In step S119, the audio data in audio frame x+2 
onwards is multiplexed with the video data that is 
5 extracted from the VOBUs located at the start of the 

latter VOB to generate the VOBUs that should be arranged 
at the front of the latter VOB. 

In detail, the control unit 1 has the audio pack 
G3, which includes the audio data sequence from the 

10 first audio frame x+2 to the audio frame y and the 

Padding_Packet , and the audio pack G4 , which includes 
the audio data sequence from the audio frame u onwards 
in the latter VOB, multiplexed with the re-encoded 
video data and has the stream encoder 2e generate the 

15 VOBUs that are to be arranged at the start of the latter 
VOB. As a result of this multiplexing, the audio frames 
at the end of the audio data of the former VOB will be 
read from the DVD-RAM at the same time as sets of 
picture data that will be reproduced at a later time. 

20 Fig. 25 shows how the audio packs that store a 

plurality of sets of audio data to be reproduced for a 
plurality of audio frames are multiplexed with video 
packs that store picture data that is to be reproduced 
for a plurality of video frames. 

25 In Fig. 25, the transfer of the picture data V TOP 
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that should be decoded at the start of the latter VOB 
will be completed within the period Tf_Period. The pack 
sequence arranged below this period Tf_Period in Fig. 25 
shows the packs that compose the picture data V_TOP. 
5 In Fig. 25 , the audio pack G3 that includes the 

audio gap stores the sets of audio data x+2, y-1, y that 
are to be reproduced for the audio frames x+2, y-1, y. 
Of the sets of audio data stored in this audio pack, the 
first to be decoded is the audio data x+2. 

10 This audio data x+2 should be decoded at the 

presentation end time of the audio frame x+1, and so 
should be read from the DVD-RAM together with the 
picture data V_TOP whose pack sequence is transferred 
during the same period (Tf_Period) as the audio frame 

15 x+1. As a result, this audio data is inserted between 
the video pack sequence P51, which stores the picture 
data V_TOP, and the video pack sequence P52, as shown at 
the bottom of Fig. 25. 

In the audio pack G4 that stores the sets of audio 

20 data u, u+1, and u+2 that are to be reproduced for the 
audio frames u, u+1, and u+2, the audio data u is to be 
decoded first. This audio data u should be decoded at 
the presentation end time of the audio frame u-1, so 
that this audio data u should be read from the DVD-RAM 

25 together with the picture data V_NXT whose pack sequence 
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is transferred during the same period. As a result, 
this audio data u is inserted between the video pack 
sequence P52, which stores the picture data V_TOP, and 
the video pack sequence P53 which stores the picture dat 
5 V_NXT, as shown at the bottom of Fig. 25. 

As shown above, the audio pack G3 that includes the 
audio gap is inserted between the video pack sequences 
P51 and P52, while the audio pack G4 is inserted between 
the video pack sequences P52 and P53, thereby completing 

10 the multiplexing. 

After this, in step S120 the control unit 1 inserts 
the First_SCR and Last_SCR of the former VOB and latter 
VOB, the seamless flag, the VOB_V_E_PTM, and the 
VOB_V_S_PTM into the seamless linking information for 

15 the former VOB. In steps S121 and S122, the control 
unit 1 writes all of the information relating to the 
audio gap, which is to say the audio gap start time, 
A_STP_PTM, the audio gap length A_GAP_LEN, and the audio 
gap location information A__GAP_LOC into the seamless 

20 linking information. 

After the above processing, the control unit 1 has 
the end of the former VOB, the start of the latter VOB, 
and the seamless linking information written onto the 
DVD-RAM. 

25 The video packs and audio packs that store the 
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video data and audio data obtained through the above re- 
encoding are assigned SCRs with ascending values. The 
initial value of the assigned SCRs is the value of the 
SCR of the pack originally located at the start of the 
5 area subjected to the re-encoding. 

Since the SCRs show the time at which the 
respective video packs and audio packs should be 
inputted into the video buffer 4b and the video decoder 
4c, if there is a change in the amount of data before 

10 and after re-encoding, it will be necessary to update 
the values of the SCRs. Even if this is the case, 
however, the decoding process will still be carried out 
correctly provided that the SCRs for the re-encoded 
first part of the latter VOB are below the SCRs of the 

15 video packs in the remaining part of the latter VOB that 
was not re-encoded. 

The PTS and the DTS are assigned in accordance with 
the video frames and audio frames, so that there will be 
no significant change in their values when re-encoding 

20 is performed. As a result, continuity of the DTS-PTS is 
maintained between the data not subjected to re-encoding 
and the data in the re-encoded area. 

To reproduce two VOBs seamlessly, non-continuity in 
the time stamps must be avoided. To do so, the control 

25 unit 1 judges in step S123 of Fig. 22 whether 
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overlapping SCRs have appeared. If this judgement is 
negative, the processing in the flowchart of Fig. 22 
ends. If overlapping SCRs have appeared, the control 
unit 1 proceeds to step S124 where it calculates the 
5 excess amount A based on the number of packs that have 
the overlapping SCRs. The control unit 1 then returns 
to step S110 to repeat the re-encoding, basing the 
amount of assigned code for the repeated re-encoding on 
this excess amount A. 
10 As shown by the arrow (5) in Fig. 17, the six VOBUs 

that have been newly multiplexed by the processing in 
Fig. 22 are outputted to the disc access unit 3. The 
disc access unit 3 then writes the VOBU sequence onto 
the DVD-RAM . 

15 It should be noted that while the flowchart of Fig. 

21-Fig. 22 describes the seamless linking of two VOBs, 
the same processing may be used to link two sections of 
the same VOB. For the example shown in Fig. 6B, when 
deleting the VOBUs #2, #4, #6, and #8, the VOBU located 

20 before each deleted part may be seamlessly linked to the 
VOBU located after the deleted pack by the processing in 
Figs. 21 and 22. 

The following is a description of the reproduction 
procedure for seamlessly reproducing two VOBs that have 

25 been seamlessly linked by the processing described 
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above . 

When the user indicates the seamless reproduction 
of two or more VOBs recorded in an AV file, the control 
unit 1 first refers to the seamless flag in the seamless 
5 linking information of the latter VOB. If this seamless 
flag is "on", the control unit 1 sets the time obtained 
by subtracting the video presentation start time 
VOB_V_S_PTM of the latter VOB from the video 
presentation end time VOB_V_JE_PTM of the former VOB to 

10 obtain the STC_offset. The control unit 1 then has the 
adder 4h add the STC_offset to the standard time 
measured by STC unit 4g. 

After this, the buffer input time First_SCR of the 
former VOB indicated by the seamless linking information 

15 is compared with the standard time measured by the STC 

unit 4g. When the standard time reaches this FirstJSCR, 
the control unit 1 controls the switch SW1 to switch to 
output the offset standard time outputted by the adder 
4h instead of the standard time outputted by the STC 

20 unit 4g. After this, the control unit 1 switches the 
states of the switches SW2~SW4 in accordance with the 
timing chart in Fig. 20. 

With the present embodiment, seamless reproduction 
of a plurality of VOBs can be achieved by reading and 

25 re-encoding only the respective ends and starts of the 

121 



WO 99/14935 



PCT/JP98/04174 



VOBs. Since the re-encoded data is only the VOBUs 
located at the start and end of the VOBs, the re- 
encoding of VOBs can be achieved in a very short time. 
Note that while the present embodiment describes a 
5 case where seamless linking information is managed for 
each VOB, the information that is required for the 
seamless linking of VOBs may be collectively provided. 
As one example, the video presentation end time 
VOB_V_E_PTM and the video presentation start time 

10 VOB_V_S_PTM that are used to calculate the STC_offset 

are described as being given in two separate sets of VOB 
information, though these may be given as the seamless 
linking information of the latter VOB. When doing so, 
it is desirable for the VOB information to include 

15 information for the presentation end time of the 
previous VOB ( PREV_VOB_V_E_PTM) . 

In the same way, it is preferable for information 
that is the final SCR in the former VOB 
(PREV_VOB_LAST_SCR) to be included in the seamless 

20 linking information of the latter VOB. 

In the present embodiment, the DVD recorder 7 0 was 
described as being a device that takes the place of a 
conventional (non-portable) domestic VCR, although when 
a DVD-RAM is used as the recording medium for a 

25 computer, the following system setup may be used. The 
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disc access unit 3 may function as a DVD-RAM drive 
device, and may be connected to a computer bus via an 
interface that complies to SCSI, IDE or IEEE1394 
standard. 

5 In such a case, the DVD recorder 70 will include a 

control unit 1, an MPEG encoder 2, a disc access unit 3, 
an MPEG decoder 4, a video signal processing unit 5, a 
remote controller 71, a bus 7, a remote control signal 
reception unit 8, and a receiver 9. 

10 In the above embodiment, VOBs were described as 

being a multiplexed combination of a video stream and an 
audio stream, although sub-picture data produced by 
subjecting data for subtitles to run-length encoding may 
also be multiplexed into VOBs. A video stream composed 

15 of sets of still image data may also be multiplexed. 

In addition, the above embodiment describes the 
case where the re-encoding of data is performed by the 
MPEG decoder 4 after the VOBs have been decoded by the 
MPEG encoder 2. However, during the re-encoding the 

20 VOBs may instead be directly inputted from the disc 
access unit 3 to the MPEG encoder 2 without prior 
decoding . 

The present embodiment describes the case where one 
picture is depicted using one frame, although there are 
25 cases one picture is in fact depicted using 1.5 frames, 

123 



WO 99/14935 



PCT/JP98/04174 



such as for a video stream where 3:2 pulldown is used 
with images for 24 frame per second being subject to 
compression, inn the same way as with film materials. 
The processing module software represented by the 
5 flowcharts in this first embodiment (Figs. 21-22) may be 
realized by a machine language program which may be 
distributed and sold having been recorded on a recording 
medium. Examples of such recording medium are an IC 
card, an optical disc, or a floppy disc. The machine 

10 language program recorded on the recording medium may 

then be installed into a standard personal computer. By 
executing the installed machine language programs, the 
standard personal computer can achieve the functions of 
the video data editing apparatus of the present 

15 embodiment. 



Second E mbodiment 

While the first embodiment deals with a premise 

20 that seamless linking is performed for VOBs, this second 
embodiment describes the seamless linking of a plurality 
of parts of VOBs. In this second embodiment, these 
parts of a VOB are specified using time information that 
indicates video fields. The video fields referred to 

25 here are units that are smaller than one video frame, 
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with the time information for video fields being 
expressed using the PTS of video packs. 

The parts of a VOB that are specified using time 
information for video fields are called cells, and the 
5 information used for indicating these cells is called 
cell information. Cell information is recorded in the 
RTRW management file as one element in the PGC 
information. The details of the data construction and 
generation of cell information and PGC information is 

10 given in the fourth embodiment. 

Fig. 2 6 shows examples of the cells indicated by 
the video fields for the start and the end. In Fig. 26, 
the sets of time information C__V_S_PTM, C_V_E PTM 
specify the video fields at the start and end of a cell. 

15 In Fig. 2 6, the time information C_V_S_PTM is the 

presentation start time of a video field at which the P 
picture in VOBU#100 that forms one part of the present 
VOB should be reproduced. In the same way, the time 
information C_V_E_PTM is the presentation end time of a 

20 video field at which the Bl picture in VOBU#105 that 

forms one part of the same VOB should be reproduced. As 
shown in Fig. 2 6, the time information C_V_S_PTM and 
C_V_E_PTM specify a section from a P picture to a B 
picture as a cell. 
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(2-1) Reconstruction of GOPs 

When seamlessly linking parts of a VOB that are 
indicated by time information, it becomes necessary to 
use two processes that were not required in the first 
5 embodiment. First, the construction of the GOPs has to 
be reconstructed to convert the section indicated by the 
time information into a separate VOB, and second, the 
increases in buffer occupancy due to the reconstruction 
of GOPs have to be estimated. 

10 The reconstruction of GOPs refers to a process that 

changes the construction of GOPs so that the section 
indicated as a cell has the proper display order and 
coding order. 

More specifically, when a section to be linked is 

15 indicated by cell information, there can be cases where 
an editing boundary is defined midway through a VOBU, as 
shown in Fig. 28A. If this is the case, the two cells 
to be linked will not have a proper display order or 
coding order. 

20 In order to rectify the display order and coding 

order, the reconstruction of GOPs is performed using 
processing based on the three rules shown in Fig. 28B. 

When the final picture data in the display order of 
a former cell is a B picture, the processing based on 

25 the first rule re-encodes this picture data to convert 
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it into a P picture (or an I picture) . The P picture in 
the forward direction that was referred to by the B 
picture is located before the B picture in the coding 
order. However, this P picture will not be displayed 
5 after the editing, and so is deleted from the VOB. 

When the first picture data in the encoding order 
of the latter cell is a P picture, the processing based 
on the second rule re-encodes this picture data to 
convert it to an I picture. 

10 When the first set or consecutive sets of picture 

data in the display order of the latter cell is/are B 
pictures, the processing based on the third rule re- 
encodes this picture data to convert it to picture data 
whose display does not rely on the correlation with 

15 other images that have previously been reproduced. 

Hereinafter, images formed of picture data that only 
relies on correlation with images that are yet to be 
displayed will called Forward-B pictures. 

20 (2-2) Estimating the Increase in Buffer Occupancy 

When the picture types of certain images have been 
changed by the processing based on the three rules 
described above, the processing for estimating the 
increases in buffer occupancy estimates the sizes of 

25 these converted sets of picture data. 
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When the reconstruction described above is 
performed for the former cell, the final picture data in 
the reproduction order of the former cell is converted 
from a B picture to a P picture or an I picture, thereby 
5 increasing the size of this data. 

When the reconstruction described above is 
performed for the latter cell, the picture data located 
at the start of the coding order of the final cell is 
converted from a P picture to an I picture, and the 

10 picture type of the video data located at the front of 
the display order is converted to a Forward-B picture. 
This also increases the size of the data. 

The following is an explanation of the procedure 
for estimating the increases in data size that accompany 

15 the conversion in picture type. Fig. 29A and 29B will 
be used to explain this procedure. 

In Fig. 29A, the first cell continues as far as the 
B picture B3 . According to the above rules, the video 
data editing apparatus has to convert this B picture B3 

20 to the P picture PI. When the B picture B3 is dependent 
on the P picture P2 that is reproduced after the B 
picture B3, the picture type conversion process will 
incorporate the necessary information of the P picture 
P2 into the P picture PI 1 that is produced by the 

25 conversion process. 
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In view of this procedure, the video data editing 
apparatus can estimate the data size of the P picture 
PI ! that is obtained by the conversion process using a 
sum of the size of the B picture B3 and the size of the 
5 P picture P2 . This estimation method merely represents 
one potential method, however, so that other methods are 
equally possible. By determining the amount of code for 
use in re-encoding based on the estimated buffer 
occupancy, the video data editing apparatus can assign 
10 an optimal amount of code to the former cell and latter 
cell . 

Figs. 30A and 30B show how the increases in buffer 
occupancy that accompany changes in picture type within 
the latter cell are estimated. 

15 In Fig. 30A, the data from the B picture B3 onwards 

belongs to the latter VOB . Each cell is determined 
based on the display time for the start of the cell, so 
that the B picture B3 is the picture data located at the 
start of the display order of the latter cell. As a 

20 result, the video data editing apparatus needs to 

convert the B picture B3 into the Forward-B picture B ! 
according to the rules given above. When this B picture 
B3 had an information component that is dependent on the 
previously reproduced P picture P2, this information 

25 component of the P picture P2 will have been 
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incorporated into the Forward-B B f picture during the 
picture type conversion. 

In view of this procedure, the video data editing 
apparatus can estimate the data size of the Forward-B 
5 picture B ! that is obtained by the conversion process 

using a sum of the size of the B picture B3 and the size 
of the P picture P2 . 

For the latter VOB, the video data editing 
apparatus needs to convert the picture type of the 

10 picture data located at the start of the coding order. 

By referring to the display order of the latter VOB in 
Fig. 28A, it can be seen that the P picture P3 is the 
picture data that is to be displayed immediately after 
the B picture B3 . The P picture P3 is stored in the 

15 reordering buffer 4f of the video data editing apparatus 
until the decoding of the B picture B3 is complete, and 
so is only displayed after the decoding of B picture B3 
has been performed. By having the reordering buffer 4f 
reorder the picture data in this way, the P picture P3 

20 will precede the B picture B3 in the coding order even 

though P picture P3 is displayed after the B picture B3 . 
According to the rules described earlier, the video data 
editing apparatus needs to convert picture data P3 
detected as the first picture data in the coding order 

25 into an I picture. When this P picture has an 
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information component that relies on the I picture that 
is reproduced before the P picture P3, this information 
component of the I picture will have been incorporated 
into the P picture P3 during the picture type 
5 conversion. 

In view of this procedure, the video data editing 
apparatus can estimate the data size of the I picture I f 
that is obtained by the conversion process using a sum 
of the size of the P picture P3 and the size of the 
10 preceding I picture. Based on the buffer occupancy that 
is estimated in this way, the video data editing 
apparatus can then assign optimal amounts of code to the 
former and latter cells to be used in the re- 

encoding . 

15 

(2-3) Procedure for Seamlessly Connecting Cells 

Figs. 31 to 33 are flowcharts showing the procedure 
that links two cells to enable seamless reproduction of 
the two. Note that many of the steps in these 

20 flowcharts are the same as the steps in the flowcharts 

shown in Figs. 21 and 22 with the term "VOB" having been 
replaced with term "cell". These steps have been given 
the same reference numerals as in the first embodiment, 
and their explanation has been omitted. 

25 Fig. 34 shows the audio frames in the audio stream 
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that correspond to the audio frame x, the audio frame 
x+1, and the audio frame y that are used in Fig. 31. 

In step S102, the control unit 1 refers to the time 
information specifying the end of the cell to be 
5 reproduced first (hereinafter called the "former cell") 
and the time information specifying the end of the cell 
to be reproduced second (hereinafter called the "latter 
cell") and subtracts the C_VJS_PTM of the latter cell 
from the C_V_E_PTM of the former cell to obtain the 

10 STCjDff set . 

In step S103, the control unit 1 analyzes the 
changes in the buffer occupancy from the First_SCR of 
the former cell to the decode end time Last_DTS of all 
of the data in the former cell. 

15 In step S104, the control unit 1 performs the same 

analysis as in step S103 for the latter cell, and so 
analyzes the changes in the buffer occupancy from the 
First_SCR of the latter cell to the decode end time 
LastJDTS of all of the data in the latter cell. 

20 In step S130, the control unit 1 estimates the 

increase a in the buffer occupancy that accompanies the 
changes in picture type for the latter cell, in 
accordance with the procedure shown in Figs. 30A and 
30B. In step S131, the control unit 1 estimates the 

25 increase 3 in the buffer occupancy that accompanies the 
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changes in picture type for the former cell, in 
accordance with the procedure shown in Figs. 2 9A and 
29B. In step S132, the control unit 1 adds the 
estimated increases a, 3 to the respective buffer 
5 occupancy for the latter and former cells. 

In step S105, the control unit 1 analyzes the 
changes in the occupancy of the video buffer from the 
First_SCR of the latter cell + STC_offset to the 
Last_DTS of the former cell. 
10 As shown in Fig. 10C of the first embodiment, the 

highest occupancy Bvl+Bv2 of the video buffer 4b is 
obtained for the period where video data for both the 
former cell and latter cell is stored in the video 
buffer 4b. 

15 In step S106, the control unit 1 controls the disc 

access unit 3 to read the three VOBs believed to include 
the picture data located at the end of the former cell 
from the DVD-RAM. After this, in step S107 the control 
unit 1 controls the disc access unit 3 to read the three 

20 VOBs believed to include the picture data located at the 
start of the latter cell. 

Fig. 27A shows the area that should be read from 
the former cell in step S106. Fig. 27B shows the VOB 
includes VOBUs #98 to #107, with VOBUs #99 to #105 being 

25 indicated as the former cell. When the picture data to 
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be reproduced last in the former cell is the picture 
data Bend, this picture data will be included in one of 
VOBUs #103 to #105 in accordance with the one-second 
rule, so that VOBU#103 to VOB#105 will be read as the 
5 VOBU sequence that includes the picture data to be 
reproduced last. 

The VOB shown in Fig. 27B includes the VOBUs #4 98 
to #507, and of these, VOBUs #500 to #506 are indicated 
as the latter cell. When the picture data to be 

10 displayed first in this latter cell is the picture data 
PTOP, this picture data PTOP will be included in VOBUs 
#500 to #502, so that VOBUs #500 to #502 will be read as 
the VOBU sequence that includes picture data to be 
displayed first. These VOBUs include the all of the 

15 picture data that depends on the picture data PTOP and 
the picture data Bend, in addition to the audio data 
that is to be reproduced at the same time as the picture 
data PTOP and the picture data Bend. As a result, all 
of the picture data that is required for the conversion 

20 of picture types is read by this operation. 

It should be noted that the reads in this flowchart 
are performed for VOBU units, although the reads may 
instead be performed for the picture data and audio data 
that is to be reproduced in one second, out of all of 

25 the picture data and audio data included in a VOBU. In 
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the present embodiment, the number of VOBs that 
correspond to one second of reproduction is given as 
three, although any number of VOBs may be used. Reads 
may alternatively be performed for picture data and 
5 audio data that is to be reproduced in a period longer 
than one second. 

After these reads are complete, in step S108 the 
control unit 1 controls the demultiplexer 4a to separate 
the video data and audio data from the VOBU located at 
10 the end of the former cell and the start of the latter 
cell . 

In step S109, the control unit 1 judges whether 
the accumulated amount of data in the buffer exceeds the 
upper limit of the buffer at any point in the decoding 

15 when the former cell and the latter cell coexist in the 
buffer. More specifically, this is achieved by judging 
whether the value Bvl+Bv2 calculated in step S105 
exceeds the upper limit of the buffer. 

If this value does not exceed the upper limit, the 

20 processing advances to step S133, or if the value does 
exceed the upper limit, the control unit 1 assigns an 
amount of code based on the excess amount A to the 
former cell and latter cell in Step S110. Note that the 
re-encoding performed in this case may only be performed 

25 for one of the former VOB and latter VOB, or for both. 
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In step Sill, the video data obtained from the two cells 
is re-encoded according to the amount of code assigned 
in step S110. 

In step S133, the First__SCR that has been newly 
5 assigned to the re-encoded video data in the latter cell 
is obtained. In this latter cell, the first picture 
data in the display order and the first picture data in 
the coding order will have been converted into picture 
types with larger amounts of picture data, so it should 

10 be obvious that the value First_SCR+STC_of f set will 
indicate an earlier time than before. 

In step S112, the control unit 1 calculates the 
audio data, out of the audio data separated from the 
former cell, that corresponds to the audio frame x which 

15 includes the sum of the STC_offset and the First_SCR 

that is newly assigned to the video data in the latter 
VOB. In Fig. 34, the upper and lower graphs 
respectively show the transition in the buffer occupancy 
due to the video data in the former cell and latter 

20 cell. The lower graph in Fig. 34 shows the audio frames 
of the audio data separated from the former cell. 

The audio frame sequence below the lower graph in 
Fig. 34 shows each audio frame against the time axis of 
the graph given above it. The buffer occupancy for the 

25 new latter cell obtained as a result of the re-encoding 
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increases by the amount al . Note that this amount al 
differs from the increased amount a that was estimated 
in step S132. Due to this amount al, the First_SCR that 
is newly assigned to the latter video data indicates an 
5 earlier time. 

As can be seen from the lower graph in Fig. 34, the 
new value of First_SCR+STC_of f set is positioned at time 
which is Tal earlier than before. In Fig. 34, the 
descending guideline drawn from the new value of 

10 First_SCR+STC_of f set intersects one audio frame in the 
audio frame sequence of the former cell. This 
intersected audio frame is the audio frame x, with the 
following audio frame x+1 being the final audio frame in 
the former cell. 

15 Since the value of the sum of the STC_offset and 

the new FirstjSCR of the latter cell indicates an 
earlier time, this means that an earlier frame is 
indicated as the audio frame x. As a result, when a 
read is commenced for the video data in the latter cell, 

20 the audio data that should be read from the former cell 
together with this video data is comparatively larger 
than in the first embodiment . 

Hereafter, the processing in steps S113 to S119 is 
performed so that the stream encoder 2e performs the 

25 multiplexing shown in Fig. 25. 
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After this, in step S120 the First_SCR, Last__SCR, 
the seamless flag, the C_V_E_PTM, and the C_V_S_PTM for 
the former and latter cells are inserted into the 
seamless linking information of the former cell. The 
5 control unit 1 then performs the processing in steps 

S121 and S122. Of the data for the six VOBUs obtained 
through the re-encoding, the three VOBUs arranged at the 
start (the first VOBUs) originally formed part of the 
former cell, and so are appended to the end of the 
10 former cell. Similarly, the three VOBUs arranged at the 
end (the latter VOBUs) originally formed part of the 
latter cell, and so are inserted at that start of the 
latter cell. 

While one of the former and latter cell that have 
15 been given re-encoded data is managed having been 

assigned the same identifier as the VOB from which it 
was taken, the other of the two cells is managed having 
been assigned a different identifier to the VOB from 
which it was taken. This means that after this 
20 division, the former cell and latter cell are managed as 
separate VOBs . This is because there is a high 
possibility of the time stamps not being continuous at 
the boundary between the former cell and the latter 
cell . 

25 As in the first embodiment, in step S123 the 
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control unit 1 judges whether the values of the SCR are 
continuous. If so, the control unit 1 ends the 
processing in the flowcharts of Figs. 31 to 33. If not, 
the control unit 1 calculated the excess amount A based 
5 on the number of packs given overlapping SCRs, 

determines an amount of code based on the excess amount 
A, and returns to step S109 to repeat the re-encoding. 

As a result of the above processing, cells are re- 
encoded, with the cells indicated by the cell 
10 information being set as separate VOBs . This means that 
VOB information for the newly generated VOBs need to be 
provided in the RTRW management file. The following is 
an explanation of how this VOB information for cells is 
defined. 

15 The "video stream attribute information" includes 

compression mode information, TV system information, 
aspect ratio information, and resolution information, 
although this information may be set to match the 
information for the VOB(s) from which the cells were 

20 taken. 

The "audio stream attribute information" includes 
an encoding mode, the presence/absence of dynamic range 
control, a sampling frequency, and a number of channels, 
although this information may be set to match the 
25 information for the VOB(s) from which the cells were 
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taken . 

The "time map table" is composed of the size of 
each VOBU that composes the VOB and the display period 
of each VOBU, although a corresponding part of the 
5 information given for the VOB(s) from which the cells 
were taken may be used, with the sizes and display 
periods only being amended for VOBUs that have been re- 
encoded. 

The following is an explanation of the "seamless 
10 linking information" that was generated in step S133. 

This seamless linking information is composed of a 

seamless flag, a video presentation start time 

VOB_V_S_PTM, a video presentation end time VOB_V_E_PTM, 

a First_SCR, a Last_SCR, an audio gap start time 
15 A_J3TP_PTM, and an audio gap length A_GAP__LEN . These 

elements are written into the seamless linking 

information one at a time. 

Only when the relationship between the former cell 

and the latter cell is satisfies the following 
20 conditions (1) and (2) is the seamless flag set at "01". 

If either condition is not satisfied, the seamless flag 

is set at "00." 

(1) Both cells must use the same display method 

(NTSC, PAL, etc.) for the video stream as given in the 
25 video attribute information. 
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(2) Both cells must use the same encoding method 
(AC-3, MPEG, Linear-PCM) for the audio stream as given 
in the audio attribute information. 

The "video presentation start time VOB_V_S__PTM" is 
5 updated to the presentation start time after re- 
encoding . 

The "video presentation end time VOB_V_E PTM" is 
updated to the presentation end time after re-encoding. 
The "First_SCR" is updated to the SCR of the first 
10 pack after re-encoding. 

The "Last_SCR" is updated to the SCR of the final 
pack after re-encoding. 

The "audio gap start time AJSTPJPTM" is set at the 
presentation end time of the audio frame y that is the 
15 final audio frame to be reproduced for the audio data 
that is moved to the latter cell in Fig. 34. 

The "audio gap length A_GAP__LEN" is set as the 
period from the presentation end time of the final audio 
frame y to be reproduced using the audio data that is 
20 moved to the latter cell in Fig. 34 to the presentation 
start time of the audio frame u. 

Once the VOB information has been generated as 
described above, an RTRW management file included this 
new VOB information is recorded onto the DVD-RAM. By 
25 doing so, the two cells that are indicated by the cell 
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information can be recorded on the DVD-RAM as two VOBs 
that are to be reproduced seamlessly. 

As described above, this second embodiment can 
process cells in a VOB or VOBs so as to have the cells 
5 seamlessly reproduced by merely reading and re-encoding 
the end of the former cell and the start of the latter 
cell. Since only the VOBUs located at the start and end 
of the respective cells are re-encoded, this re-encoding 
of cells can be achieved in a very short time. 

10 It should be noted that while the present 

embodiment describes the case where video fields as used 
as the unit when indicating cells, video frames may be 
used instead. 

The processing module software represented by the 

15 flowcharts in this first embodiment (Figs. 31-33) may be 
realized by a machine language program which may be 
distributed and sold having been recorded on a recording 
medium. Examples of such recording medium are an IC 
card, an optical disc, or a floppy disc. The machine 

20 language program recorded on the recording medium may 

then be installed into a standard personal computer. By 
executing the installed machine language programs, the 
standard personal computer can achieve the functions of 
the video data editing apparatus of the present 

25 embodiment. 
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Third Embodiment 

The third embodiment of the present invention manages 
AV files in a file system and allows greater freedom in 
video editing. 

5 

3-1 Directory Structure on a DVD-RAM 

The RTRW management file and AV files of the first 
embodiment are arranged in the directories shown in Fig. 35 
within a file system that complies to ISO/IEC 13346. In 
10 Fig. 35, the ovals represent directories and the rectangles 
represent files. The root directory includes directories 
called a "RTRW" and two files called "Filel.DAT" and 
"File2.DAT". The RTRW directory includes three files called 
"Moviel . VOB", "Movie 2 . VOB", and "RTRWM.IFO". 

15 

(3-1-1) File System Management Information in the 
Directories 

The following is a description of the management 
information used for managing the RTRW management file and 
20 AV files in the directory structure shown in Fig. 35. Fig. 
3 6 shows the file system management information in the 
directory structure of Fig. 35. 

Fig. 3 6 shows the volume area shown in Fig. 3D, the 
sectors, and stored contents of sectors in a hierarchy. 
2 5 Arrows ®~® in this drawing show the order in which the 

storage positions of the file "Moviel. VOB" are specified by 
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the present management information. 

The first level in the hierarchy in Fig. 36 shows the 
volume area shown in Fig. 3D. 

The second level in the hierarchy shows file set 
5 descriptors, end descriptors, file entries, and directories, 
out of the entire management information. The information 
on this second level complies with a file system that is 
standardized under ISO/IEC 13346. File systems that are 
standardized under ISO/IEC 13346 manage directories in a 
10 hierarchy. 

The management information in Fig. 3 6 is arranged in 
accordance with the directory structure. However, a 
recording region is only shown for the AV file "Moviel . VOB" . 
The file set descriptor (LBN 80) on the second level 
15 shows information such as the LBN of the sector that stores 
the file entry for the root directory. The end descriptor 
(LBN 81) shows the end of the file set descriptor. 

A file entry (such as LBN 82, 584, 3585) is stored for 
each file (or directory) , and shows a storage position for a 
20 file or directory. File entries for files and file entries 
for directories have a same format, so that hierarchical 
directories can be freely constructed. 

A directory (such as LBN83, 584, 3585) shows storage 
positions for file entries of the files or directories 
25 included in the directory. 

Three files entries and two directories are shown on 
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the third level in the hierarchy. The file entries and 
directories are tracked by the file system and have a data 
construction that enables the storage position of a 
specified file to be indicated regardless of the 
5 construction of the hierarchy in the directory structure. 

Each file entry includes an allocation descriptor that 
shows a storage position of a file or directory. When the 
data recorded in each file is divided into a plurality of 
extents, a file entry includes a plurality of allocation 

10 descriptors for each extent. 

The expression "extent" refers here to a section of 
data included in a file that should be preferably stored in 
consecutive regions. When, for example, the size of a VOB 
to be recorded in an AV file is large, but there are no 

15 consecutive regions for storing the VOB, the AV file cannot 
be recorded on the DVD-RAM . 

However, when there is a plurality of small consecutive 
regions distributed across the partition area, by dividing 
the VOBs to be recorded in the AV file, the resulting 

20 divided sections of the VOBs may be stored into the 
distributed consecutive areas. 

By dividing VOBs in this way, the probability of being 
able to store VOBs as AV files increases, even when the 
number of consecutive regions and length of the partition 

25 area are limited. To improve the efficiency with which data 
is recorded on a DVD-RAM, the VOBs recorded in one AV file 
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are divided into a plurality of extents, with these extents 
being recorded in separate consecutive areas on the disc 
without regard to the order of the extents. 

It should be noted that the expression "consecutive 
5 regions" here refers to a region composed of ECC blocks that 
are logically or physically consecutive. 

As one example, the file entries with the LBN 82 and 
584 in Fig. 36 each include a single allocation descriptor, 
which means that the file is not divided into a plurality of 

10 extents (which is to say, is composed of a single extent) . 
The file entry 3585 meanwhile has two allocation 
descriptors, which means that the data to be stored in the 
file is composed of two extents. 

Each directory includes a file identification 

15 descriptor showing a storage position of a file entry for 
each file and each directory included in the directory. 
When tracing a route through the file entries and 
directories, the storage position of the file 
"root/video/Moviel . VOB" can be found by following the order 

20 given as file set descriptor -©-file entry (root ) -©-director 
y (root) -©-file entry (RTRW) -©-directory (RTRW) -©-file 
entry (Moviel . VOB) -®®-f ile (extents #1 and #2 of 
Movie 1 .VOB) . 

Fig. 37 shows the link relationship between the file 
25 entries and directories on this route in another format that 
traces the directory construction. In this drawing, the 
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directories used for route include file identification 
descriptors for the directory of the parent directory (the 
parent of the root being the root itself) , the RTRW 
directory, the Filel.DAT file, and the File2.DAT file. The 
5 RTRW directory includes file identification descriptors for 
each of the directory of the parent directory (root) , the 
Movie 1.V0B file, the Movie2.V0B file, and the RTRWM.IFO 
file. In the same way, the storage position of the 
Movie 1.V0B file is specified by tracing the route ®-®(Z). 

10 

3-1-2 Data Construction of a File Entry 

Fig. 38A show the data construction of a file entry in 
more detail. As shown in Fig. 38A, a file entry includes a 
descriptor tag, an ICB tag, an allocation descriptor length, 
15 expanded attributes, and an allocation descriptor. In this 

figure, the legend "BP" represents "bit position", while the 
legend "RBP" represents "relative bit position". 

The descriptor tag is a tag showing the present entry 
is a file entry. For a DVD-RAM, a variety of tags are used, 
20 such as the file entry descriptor and the space bitmap 

descriptor. For a file entry, a value "261" is used as the 
descriptor tag indicating a file entry. 

The ICB tag shows attribute information for the file 
entry itself. 

25 The expanded attributes are information showing the 

attributes with a higher-level content than the content 
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specified by the attribute information field in the file 
entry. 

The allocation descriptor field stores as many 
allocation descriptors as there are extents that compose the 
5 file. Each allocation descriptor shows the logical block 
number (LBN) that indicates the storage position of an 
extent for a file or a directory. The data construction of 
an allocation descriptor is shown in Fig. 38B. 

The allocation descriptor in Fig. 38B includes data 
10 showing the extent length and a logical block number showing 
the storage position of the extent. However, the top two 
bits of the data indicating the extent length show the 
storage state of the extent storage area. The meanings of 
the various values are as shown in Fig. 38C. 

15 

(3-1-3) Data Construction of the File Identification 
Descriptors for Directories and Files 

Figs. 39A and 39B show the detailed data construction 
of the file identification descriptors for directories and 

20 files in the various directories. These two types of file 

identification descriptors have the same format, and so each 
include management information, identification information, 
a directory name length, an address showing the logical 
block number that stores the file entry for the directory or 

25 file, expansion information, and a directory name. In this 
way, the address of a file entry is associated with a 
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directory name or file name. 

(3-1-4) Minimum Size of an AV Block 

When a VOB to be recorded in an AV file is divided into 
5 a plurality of extents, the data length of each extent must 
exceed the data length of an AV block. The expression "AV 
block" here refers to the minimum amount of data for which 
there is no danger of underflow for the track buffer 3a when 
reading a VOB from the DVD-RAM. 
10 To guarantee consecutive reproduction, the minimum size 

of an AV block is defined in relation to the track buffer 
provided in a reproduction apparatus. The following 
explanation deals with how this minimum size of an AV block 
is found. 

15 

(3-1-5) Minimum Size of an AV Block Area 

First, the rationale behind the need to determine the 

minimum size of an AV block for guaranteeing uninterrupted 

reproduction is described. 
20 Fig. 40 shows a model of how a reproduction apparatus 

that reproduces video objects buffers AV data read from the 

DVD-RAM in the track buffer. This model shows the minimum 

requirements of a reproduction apparatus for uninterrupted 

reproduction to be guaranteed. 
25 In the upper part of Fig. 40, the reproduction 

apparatus subjects the AV data it reads from the DVD-RAM to 
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ECC processing, temporarily accumulates the resulting data 
in the track buffer, which is a FIFO memory, and then 
outputs the data from the track buffer to the decoder. In 
the illustrated example, V r is the input transfer rate of the 
5 track buffer (or in other words, the rate at which data is 
read from the optical disc) , and V G is the output transfer 
rate of the track buffer (decoder input rate) , where V r >V G . 
In the present model, V r =llMbps. 

The lower part of Fig. 4 0 is a graph showing the 

10 changes in the amount of data in the track buffer for the 

present model. In this graph, the vertical axis represents 
the amount of data in the buffer, while the horizontal axis 
represents time. This graph assumes that the AV block#k 
that includes a defective sector is read following the AV 

15 block#j that includes no defective sectors. 

The period Tl shown on the time axis shows the time 
required to read all AV data in the AV block#j that includes 
no defective sectors. During this period Tl, the amount of 
data in the track buffer increases at the rate (V r -V G ) . 

20 The period T2 (hereinafter called the "jump period") 

shows the time required by the optical pickup to jump from 
the AV block#j to the AV block#k. This jump period includes 
the seek time for the optical pickup and the time taken for 
the rotation of the optical disc to stabilize. In the worst 

25 case scenario of a jump from the inner periphery to the 
outer periphery of the optical disc, the jump time is 
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assumed to be around 1500ms for the present model. During 
the jump period T2, the amount of data in the track buffer 
decreases at a rate of V Q . 

The periods T3 to T5 show the time taken to read all AV 
5 data in the AV block#k that includes a defective sector. 

Of these periods, the period T4 shows the time taken to 
skip to the next ECC block from a present ECC block that 
includes a defective sector. This skip operation involves 
skipping a present ECC block if one or more of the 16 

10 sectors is defective and jumping to the next ECC block. 

This means that in an AV block, instead of merely logically 
replacing each defective sector in an ECC block with a 
replacement sector (or a replacement ECC block) , use of each 
ECC block (all 16 sectors) with a defective sector is 

15 stopped. This method is called the ECC block skip method. 

The period T4 is the disc rotation wait time, which, in the 
worse case scenario, is the time taken for one revolution of 
the disc. This is presumed to be around 105ms for the 
present model. In periods T3 and T5, the amount of data in 

20 the buffer increases at a rate given as V r -V Q , while during 
period T4, the amount decreases at the rate V Q . 

When "N_ecc" represents the total number of ECC blocks 
in an AV block, the size of an AV block is given by the 
formula "N_ecc*16*8*2048" bits. To ensure consecutive 

25 reproduction is performed, the minimum value of N_ecc is 
found as described below. 
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In period T2, AV data is only read from the track 
buffer with no concurrent replenishing of AV data. During 
this period T2, should the amount of data in the buffer 
reach zero, an underflow will occur in the decoder. In such 
5 case, the uninterrupted reproduction of AV data cannot be 
guaranteed. As a result, the relation shown as Equation 1 
below needs to be satisfied to guarantee the uninterrupted 
reproduction of AV data (which is to say, to ensure that no 
underflow occurs) . 

0 

Equation 1 

(buffered data amount B) ^ (consumed data amount R) 



The buffered data amount B is the amount of data stored 
15 in the buffer at the end of the period Tl. The consumed 
data amount R is the total amount of data read during the 
period T2 . 

The buffered data amount B is given by Equation 2 
below. 

20 

Equation 2 

(buffered data amount B) = (period Tl)*(V r -V 0 ) 

= (read time for 1 AV block) * (V r -V Q ) 
= (AV block size L/V r ) * (V r -V Q ) 
25 = (N_ecc*16*8*2048/V r ) * (V r -V D ) 

=(N_ecc*16*8*2048) * (1-V Q /V r ) 
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The consumed data amount R is given by Equation 3 
below. 

Equation 3 
5 (consumed data amount R)=T2*V D 

Substituting Equations 2 and 3 into the respective 
sides of Equation 1 gives Equation 4 below. 

10 Equation 4 

(N_ecc*16*8*2048) * (1-V c /V r ) >T2*V G 

By rearranging Equation 4, it can be seen that the 
number N_ecc of ECC blocks that guarantees consecutive 
15 reproduction must satisfy Equation 5 below. 

Equation 5 

N_ecc>T2*V 0 / ( ( 16*8*2048 ) * ( 1-V D /V r ) ) 

20 In Equation 5, T2 is the jump period described above, 

which has a maximum of 1.5s. V r , meanwhile, has a fixed 
value, which for the model in the upper part of Fig. 4 0 is 
11Mbps. V Q is expressed by the following Equation 6 that 
takes the variable bit rate of the AV block that includes a 

25 number N ecc of ECC blocks into consideration. Note that V 

o 

is not the maximum value of the logical transfer rate for 
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output from the track buffer, but is given by the equation 
below as the effective input rate of variable rate AV data 
into the decoder. AV block length here is given as the 
number N_j?ack of packs in an AV block composed of N_ecc ECC 
5 blocks ( (N_ecc-1) *16<N_pack<N_ecc*16) . 

Equation 6 

V C =AV block length (bit) * (1/AV block reproduction time (sec)) 
= (N_pack*2048*8) * (27M/ ( SCR_f irst__next - SCR_f irst_current) ) 

10 

In the above equation, SCR_f irst_next is the SCR of the 
first pack in the next AV block, while SCR__f irst_current is 
the SCR of the first pack in the present AV block. Each SCR 
shows the time at which the corresponding pack should be 

15 outputted from the track buffer to the decoder. The unit for 
SCRs is 1/27 megaseconds. 

As shown in the above Equations 5 and 6, the minimum 
size of an AV block can theoretically be calculated in 
accordance with the actual bit rate of the AV data. 

20 Equation 5 applies to a case where no defective sectors 

exist on the optical disc. When such sectors are present, 
the number of ECC blocks Necc required to ensure 
uninterrupted reproduction is as described below. 

It is presumed here that the AV block area includes ECC 

25 blocks with defective sectors, the number of which is 

represented as "dN_ecc" . No Av data is recorded into the 
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dN_ecc defective ECC blocks due to the ECC block skipping 
described above. The loss time Ts caused by skipping the 
dN_ecc defective ECC blocks is represented as "T4*dN ecc, " 
where "T4" represents the ECC block skip time for the model 
shown in Fig. 40. 

To ensure the uninterrupted reproduction of the AV data 
when defective sectors are included, the AV block area needs 
to include as the number of ECC blocks as represented by 
Equation 7 . 

Equation 7 

N_ecc > dN_ecc + V c * (Tj +Ts )/(( 16*8*2048 ) * ( 1-V c /V r ) ) 

As described above, the size of the AV block area is 
calculated from Formula 5 when no defective sector is 
present, and from Formula 7 when defective sectors are 
present . 

It should be noted here that when AV data is composed 
of a plurality of AV blocks, the first and last AV blocks do 
not need to not satisfy Equation 5 or 7 . This is because 
the timing at which decoding is commenced for the first AV 
block can be delayed, which is to say, the supply of data to 
the decoded may be delayed until sufficient data is 
accumulated in the buffer, thereby ensuring uninterrupted 
reproduction between the first and second AV blocks. The 
last AV block, meanwhile, is not followed by any particular 
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AV data, meaning that the reproduction may simply end with 
this last AV block. 

(3-2) Functional Blocks of the DVD Recorder 70 
5 Fig. 41 is a function block diagram showing the 

construction of the DVD recorder 70 divided into functions. 
Each function in Fig. 41 is realized by the CPU la in the 
control unit 1 executing a program in the ROM le to control 
the hardware shown in Fig. 17. 

10 The DVD player of Fig. 41 includes the disc recording 

unit 100, the disc reading unit 101, the common file system 
unit 10, the AV file system unit 11, the recording-editing- 
reproduction control unit 12, the AV data recording unit 13, 
the AV data reproduction unit 14, and the AV data editing 

15 unit 15. 

(3-2-1) Disc Reco rding Unit 100 - Disc Reading Unit 101 

The disc recording unit 100 operates as follows. On 
receiving an input of the logical sector number from which 

2 0 recording is to start and the data to be recorded from the 
common file system unit 10 and the AV file system unit 11, 
the disc recording unit 100 moves the optical pickup to the 
appropriate logical sector number and has the optical pickup 
record data in ECC block units (16 sectors) into the 

25 indicated sectors on the disc. When the amount of data to 

be recorded is below 16 sectors, the disc recording unit 100 
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first reads the data, subjects it to ECC processing, and 
records it onto the disc as an ECC block. 

The disc reading unit 101 operates as follows. On 
receiving an input of a logical sector number from which 
5 data is to be read and a number of sectors from the common 

file system unit 10 and the AV file system unit 11, the disc 
reading unit 101 moves the optical pickup to the appropriate 
logical sector number and has the optical pickup read data 
in ECC block units from the indicated logical sectors. The 
10 disc reading unit 101 has ECC processing performed on the 

read data and transfers only the required sector data to the 
common file system unit 10. As with the disc recording unit 
100, the disc reading unit 101 reads VOBs in units of 16 
sectors for each ECC block, thereby reducing the overheads. 



(3-2-2) Common File System Unit 10 

The common file system unit 10 provides the recording- 
editing-reproduction control unit 12, the AV data recording 

20 unit 13, the AV data reproduction unit 14, and the AV data 
editing unit 15 with the standard functions for accessing 
data format standardized under ISO/IEC 13346. These 
standard functions provided by the common file system unit 
10 control the disc recording unit 100 and the disc reading 

25 unit 101 to read or write data onto or from the DVD-RAM in 
directory units and file units. 
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Representative examples of the standard functions 
provided by the common file system unit 10 are as follows. 

1. Having the disc recording unit 100 record a file 
entry and output the file identification descriptor to the 

5 recording-editing-reproduction control unit 12, the AV data 
recording unit 13, the AV data reproduction unit 14, and the 
AV data editing unit 15. 

2. Converting a recorded area on the disc that includes 
one file into an empty area. 

10 3. Controlling the disc reading unit 101 to read the 

file identification descriptor of a specified file from a 
DVD-RAM. 

4. Controlling the disc recording unit 100 to record 
memory present in the memory onto the disc as a non-AV file. 
15 5. Controlling the disc reading unit 101 to read an 

extent that composes a file recorded on the disc. 

6. Controlling the disc reading unit 101 to move the 
optical pickup to a desired position in the extents that 
compose a file. 

20 To use any of the functions (1) to (6), the recording- 

editing-reproduction control unit 12 to AV data editing unit 
15 may issue a command to the common file system unit 10 to 
indicate the file to be read or recorded as a parameter. 
Such commands are called common file system-oriented 

25 commands. 

Various types of common file system-oriented commands 
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are available, such as "(1) CREATE", "(2) DELETE", 
" (3) OPEN/CLOSE", "(4) WRITE", "(5) READ", and "(6) SEEK". Such 
commands are respectively assigned to the functions (1) to 
(6) . 

5 In the present embodiment, the assignment of commands 

to the standard functions is as follows. To use function 
(1), the recording-editing-reproduction control unit 12 to 
AV data editing unit 15 may issue a "CREATE" command to the 
common file system unit 10. To use function (2), the 

10 recording-editing-reproduction control unit 12 to AV data 
editing unit 15 may issue a "DELETE" command to the common 
file system unit 10. In the same way, to respectively use 
functions (3), (4), (5), and (6), the recording-editing- 
reproduction control unit 12 to AV data editing unit 15 may 

15 issue an "OPEN/CLOSE", "WRITE", "READ" or "SEEK" command to 

the common file system unit 10. 

(3-2-3) AV File System Unit 11 

The AV file system unit 11 provides the AV data 
20 recording unit 13, AV data reproduction unit 14, and AV data 
editing unit 15 with extended functions which are only 
necessary when recording or editing an AV file. These 
extended functions cannot be provided by the common file 
system unit 10 . 

25 The following are representative examples of these 

extended functions . 
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(7) Writing a VOB that has been encoded by the MPEG 
encoder 2 onto a DVD-RAM as an AV file, 

(8) Cutting out an indicated part of the VOBs recorded 
in an AV file and setting the part as a different file. 

5 (9) Clearing an indicated part of the VOBs recorded in 

an AV file . 

(10) Linking two AV files that are present on the DVD- 
RAM with VOBUs that have been re-encoded according to the 
procedure in the first and second embodiments. 

10 To use the extended functions (7) to (10), the 

recording-editing-reproduction control unit 12 to AV data 
editing unit 15 may issue a command to the common file 
system unit 10 to indicate the file to be recorded, linked, 
or cut out. Such commands are called AV file system- 

15 oriented commands. Here, the AV file system-oriented 

commands "AV-WR1TE" , "SPLIT", "SHORTEN", and "MERGE" are 
available, with these being respectively assigned to the 
functions (7) to (10) . 

In the present embodiment, the assignment of commands 

2 0 to the extended functions is as follows. To use the 

function (7), the AV data recording unit 13 to AV data 
editing unit 15 may issue an AV-WRITE command. To use the 
function (8), the AV data recording unit 13 to AV data 
editing unit 15 may issue a SPLIT command. Similarly, to 

25 use the function (9) or (10), the AV data recording unit 13 
to AV data editing unit 15 may issue a "SHORTEN" or "MERGE" 
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command. With function (10), the extent of the file after 
linking is as long as or longer than an AV block. 

(3-2-4) Recording-Editina-Reproduction Control Unit 1? 
5 The recording-editing-reproduction control unit 12 

issues an OPEN/CLOSE command that indicates directory names 
as parameters to the common file system unit 10, and by 
doing so has the common file system unit 10 read a plurality 
of file identification descriptors from the DVD-RAM. The 

10 recording-editing-reproduction control unit 12 then analyzes 
the directory structure of the DVD-RAM from the file 
identification descriptors and receives a user indication of 
a file or directory to be operated upon. 

On receiving the user indication of the target file or 

15 directory, the recording-editing-reproduction control unit 
12 identifies the desired operation content based on the 
user operation notified by the remote control signal 
reception unit 8, and issues instructions to have the AV 
data recording unit 13, the AV data reproduction unit 14, 

20 and the AV data editing unit 15 perform the appropriate 
processing for the file or directory indicated as the 
operation target. 

To have the user indicate the operation target, the 
recording-editing-reproduction control unit 12 outputs 

25 graphics data, which visually represents the directory 

structure, the total number of AV files, and the data sizes 
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of empty areas on the present disc, to the video signal 
processing unit 5. The video signal processing unit 5 
converts this graphics data into an image signal and has it 
displayed on the TV monitor 72. 
5 Fig. 42 shows an example of the graphics data displayed 

on the TV monitor 72 under the control of the recording- 
editing-reproduction control unit 12. During the display of 
this graphics data, the display color of any of the files or 
directories may change to show potential operation targets. 

10 This change in color is used to focus the attention of the 

user, and so is called the "focus state". Display using the 
normal color, meanwhile, is called the "normal state". 

When the user presses the mark key on the remote 
controller 71, the display of the file or directory that is 

15 currently in the focus state returns to the normal state and 
a different, newly-indicated file or directory is displayed 
in the focus state. When any of the files or directories is 
in the focus state, the recording-editing-reproduction 
control unit 12 waits for the user to press the "confirm" 

20 key on the remote controller 71. 

When the user presses the enter key, the recording- 
editing-reproduction control unit 12 identifies the file or 
directory that is currently in the focus state as a 
potential operation target. In this way, the recording- 

25 editing-reproduction control unit 12 can identify the file 
or directory that is the operation target. 
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To identify the operation content, however, the 
recording-editing-reproduction control unit 12 determines 
what operation content has been assigned to the key code 
received from the remote control signal reception unit 8 . 
5 As shown on the left side of Fig. 41, keys with the legends 
"PLAY", "REWIND", "STOP", "FAST FORWARD", "RECORD", "MARK", 
"VIRTUAL EDIT", AND "REAL EDIT" are present on the remote 
controller 71. In this way, the recording-editing- 
reproduction control unit 12 identifies the operation 
10 content indicated by the user according to the key code 

received from the remote control signal reception unit 8 . 

(3-2-4-1) Operation Contents That C an Be Received by t-hP 
Recordina-Editina-Reproduction Control Unit 12 

15 The operation contents are classified into operation 

contents that are provided on conventional domestic AV 
equipment, and operation contents that are specially 
provided for video editing. As specific examples, "play", 
"rewind", "stop", "fast forward", and "record" all fall into 

20 the former category, while "mark", "virtual edit", and "real 
edit" all fall into the latter category. 

A "play" operation has the DVD recorder 70 play back a 
VOB that is recorded in an AV file that is specified as the 
operation target. 

25 A "rewind" operation has the DVD recorder 70 rapidly 

play back a presently reproduced VOB in reverse. 
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A "stop" operation has the DVD recorder 70 stop the 
reproduction of the present VOB. 

A "fast forward" operation has the DVD recorder 70 
rapidly play back the present VOB in the forward direction. 
5 A "record" operation has the DVD recorder 70 generate a 

new AV file in the directory indicated as the operation 
target and write the VOB to be recorded into the new AV 
file. 

These operations in this former category are well-known 

10 to users as functions of conventional domestic AV equipment, 
such as video cassette recorders and CD players. The 
operations in the latter category are performed by users 
when, to use an analogy of editing a conventional movie 
film, sections of movie film are cut out and spliced 

15 together to produce a new movie sequence. 

A "mark" operation has the DVD recorder 70 replay a VOB 
included in the AV file indicated as the operation target 
and marks desired images out of the video images replayed by 
the VOB. To use the analogy of editing a movie film, this 

2 0 "mark" operation involves the marking of points where the 
film is to be cut. 

A "virtual edit" operation has the DVD recorder 7 0 
select a plurality of pairs of two points indicated by a 
mark operation as reproduction start points and reproduction 

25 end points and then define a logical reproduction route by 
assigning a reproduction order to these pairs of points. 



164 



WO 99/14935 



PCT/JP98/04174 



In a virtual edit operation, the section defined by one 
pair of a reproduction start point and reproduction end 
point selected by the user is called a "cell". The 
reproduction route defined by assigning a reproduction order 
5 to the cells is called a "program chain". 

A real "edit" operation has the DVD recorder 70 cut out 
each section indicated as a cell from an AV file recorded on 
a DVD— RAM, set the cut-out sections as separate files, and 
link a plurality of cut-out sections in accordance with the 

10 reproduction order shown by a program chain. Such edit 

operations are analogous to the cutting of a movie film at 
the marked positions and the splicing of the cut sections 
together. In these edit operations, the extent of the 
linked files is equal to or greater than the length of one 

15 AV block. 

.The recording-editing-reproduction control unit 12 
controls which of the AV data recording unit 13 to the AV 
data editing unit 15 are used when performing the operation 
contents described above. In addition to specifying the 

20 operation target and operation content, the recording- 
editing-reproduction control unit 12 chooses the appropriate 
component (s) for the operation content out of the AV data 
recording unit 13 to AV data editing unit 15 and outputs 
instructions informing these components of the operation 

25 content. 

The following is a description of example instructions 
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that the recording-editing-reproduction control unit 12 
gives to the AV data recording unit 13, the AV data 
reproduction unit 14, and the AV data editing unit 15 using 
combinations of an operation target and an operation 
5 content. 

In Fig. 42, the directory "DVD_Video" is in the focus 
state, so that if the user presses the "RECORD" key, the 
recording-editing-reproduction control unit 12 identifies 
the directory "DVD_Video" as the operation target and 

10 "record" as the operation content. The recording-editing- 
reproduction control unit 12 selects the AV data recording 
unit 13 as the component capable of performing a record 
operation, and instructs the AV data recording unit 13 to 
generate a new AV file in the directory indicated as the 

15 operation target. 

When the file "AV_FILE#1" is in the focus state and the 
user presses the "PLAY" key on the remote controller 71, the 
recording-editing-reproduction control unit 12 identifies 
the file "AV_FILE#1" as the operation target and "play" as 

20 the operation content. The recording-editing-reproduction 
control unit 12 selects the AV data reproduction unit 14 as 
the component capable of performing a play operation, and 
instructs the AV data reproduction unit 14 to reproduce the 
AV file indicated as the operation target. 

25 When the file "AV_FILE#1" is in the focus state and the 

user presses the "MARK" key on the remote controller 71, the 
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recording-editing-reproduction control unit 12 identifies 
the file "AV_FILE#1" as the operation target and "mark" as 
the operation content. The recording-editing-reproduction 
control unit 12 selects the AV data editing unit 15 as the 
5 component capable of performing a mark operation, and 

instructs the AV data editing unit 15 to perform a marking 
operation for the AV file indicated as the operation target. 

(3-2-5 AV Data Recording Unit 13 

10 The AV data recording unit 13 controls encoding 

operations of the MPEG encoder 2 while issuing common file 
system-oriented commands and AV file system-oriented 
commands in a predetermined order to the common file system 
unit 10 and the AV file system unit 11. By doing so, the AV 

15 data recording unit 13 makes use of the functions (1) to 
(10) and realizes recording operations. 

(3-2-6) AV Data Reproduction Unit 14 

The AV data reproduction unit 14 controls decoding 

20 operations of the MPEG decoder 4, while issuing common file 
system-oriented commands and AV file system-oriented 
commands in a predetermined order to the common file system 
unit 10 and the AV file system unit 11. By doing so, the AV 
data reproduction unit 14 makes use of the functions (1) to 

25 (10) and realizes "play", "rewind", "fast forward", and 

"stop" operations . 
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(3-2-7) AV Data Editing Unit 15 

The AV data editing unit 15 controls the decoding 
operations of the MPEG decoder 4, while issuing common file 
system-oriented commands and AV file system-oriented 
5 commands in a predetermined order to the common file system 
unit 10 and the AV file system unit 11. By doing so, the AV 
data reproduction unit 14 makes use of the functions (1) to 
(10) and realizes "mark", "virtual edit", and "edit" 
operations . 

10 In more detail, on receiving instructions from the 

recording-editing-reproduction control unit 12 to mark the 
AV file indicated as the operation target, the AV data 
editing unit 15 has the AV data reproduction unit 14 
reproduce the indicated AV file and monitors when the user 

15 presses the "MARK key on the remote controller 71. When the 
user presses the "MARK" key during the reproduction, the AV 
data editing unit 15 writes information called a "mark 
point" onto the disc as a non-AV file. This mark point 
information shows the time in seconds from the start of the 

20 reproduction of the AV file to the point where the user 
pressed the "MARK" key. 

On receiving instructions from the recording-editing- 
reproduction control unit 12 for a virtual edit operation, 
the AV data editing unit 15 generates information that 

25 defines a logical reproduction route in accordance with the 
user key operations of the remote controller 71. The AV 
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data editing unit 15 then controls the common file system 
unit 10 so that this information is written onto the DVD-RAM 
as a non-AV file. 

On receiving instructions from the recording-editing- 
5 reproduction control unit 12 for a real edit operation, the 
AV data editing unit 15 cuts out the sections of the DVD-RAM 
indicated as cells and sets the cut-out sections as separate 
files which it links to form a sequence of cells. 

When linking a plurality of files, the AV data editing 

10 unit 15 performs processing so that seamless reproduction of 
images will be achieved. This means that there will be no 
interruptions in the image display when a linked AV file is 
reproduced. The AV data editing unit 15 links extents to 
make all extents, except for the last extent to be 

15 reproduced, equal to or greater than the AV block length. 

(3-2-7-1) Processing for Virtual Edits and Edits by the AV 
Data Editing Unit 15 

Fig. 43 is a flowchart for the processing of virtual 

20 edit and real edit operations. Figs. 44A to 44F are figures 
showing a supplementary example of the processing by the AV 
data editing unit 15 according to the flowchart of Fig. 43. 
The following describes the editing processes of the AV data 
editing unit 15 with reference to the flowchart of Fig. 43 

25 and the example in Figs. 44A to 44F. 

The AV file shown in Fig. 44A is already stored on the 



169 



WO 99/14935 



PCT/JP98/04174 



DVD-RAM. When this AV file is indicated as the operation 
target, the user presses the "PLAY" key on the remote 
controller 71. The recording-editing-reproduction control 
unit 12 detects key operations, so that when the user 
5 presses the "MARK" key, the AV data editing unit 15 has the 
AV data reproduction unit 14 commence the reproduction of 
the AV file in step SI. 

After the start of reproduction, the reproduction 
proceeds as far as the time tl in Fig. 44B when the user 

10 next presses the "MARK" key. In response to this, the AV 

data editing unit 15 sets the mark point#l, which expresses 
a relative time code for time tl, into the present AV file. 
The user subsequently presses the "MARK" key a total of 
seven times at times t2, t3, t4, ... t8 . In response, the 

15 AV data editing unit 15 sets the mark points #2, #3, #4, #5, 
... #8, which express relative time codes for time t2, t3, 
t4, ... t8, into the present AV file, as shown in Fig. 44B. 

After the execution of step SI, the processing proceeds 
to step S2 where the AV data editing unit 15 has the user 

20 indicate pairs of mark points. The AV data editing unit 15 
then determines the cells to be reproduced within the 
present AV file in accordance with the selected pairs of 
mark points. 

In Fig. 44C, the user indicates that mark points #1 and 
25 #2 form pair (1), mark points #3 and #4 form pair (2), mark 
points #5 and #6 form pair (3), and mark points #7 and #8 
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form pair (4) . 

In this way, the AV data editing unit 15 sets the AV 
data within each pair of points as a separate cell, and so 
in the present example sets the four cells, Cell#l, Cell#2, 
5 Cell#3, and Cell#4. Note that in the present example, the 
AV data editing unit 15 may alternatively set the pair of 
Mark#2 and Mark#3 as one cell, and the pair of Mark#4 and 
Mark#5 as another cell. 

Next, in step S3, the AV data editing unit 15 generates 

10 a program chain by assigning a reproduction order to the 

cells it has produced. In Fig. 44D, Cell#l is the first in 
the reproduction route (shown by the legend "1 st " in the 
drawing) , Cell#2 is the second in the reproduction route 
(shown by the legend "2 nd " in the drawing) , and Cells #3 and 

15 #4 respectively are the third and fourth in the reproduction 
route (shown by the legends "3 rd " and "4 th " in the drawing) . 
By doing so, the AV data editing unit 15 treats the 
plurality of cells as a program chain, based on the chosen 
reproduction order. Note that Fig. 44D shows the simplest 

20 reproduction order of cells, with the setting of other 

orders, such as Cell#3 -+ Cell#l -+ Cell#2 Cell#4, being 
equally possible. 

In step S6, the AV data editing unit 15 monitors 
whether the user has indicated the reproduction of the 

25 program chain. In step S5, the AV data editing unit 15 

monitors whether the user has indicated an editing operation 
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for the program chain. When the user indicates 
reproduction, the AV data editing unit 15 instructs the AV 
data reproduction unit 14 to reproduce the program chain 
indicated for reproduction. 
5 On receiving reproduction instructions from the AV data 

editing unit 15, the AV data reproduction unit 14 has the 
optical pickup seek Mark#l which is the reproduction start 
position for Cell#l, as shown in Fig. 44E. Once the optical 
pickup has moved to Mark#l in the AV file in accordance with 

10 the SEEK command, the AV data editing unit 15 has the 

section between Mark#l and Mark#2 read by issuing a READ 
command to the common file system unit 10. In this way, the 
VOBUs in Cell#l are read from the DVD-RAM, before being 
sequentially decoded by the MPEG decoder 4 and displayed as 

15 images on the TV monitor 72. 

Once the VOBUs have been decoded as far as Mark#2, the 
AV data editing unit 15 has the same processing performed 
for the remaining cells. By doing so, the AV data editing 
unit 15 has only the sections indicated as Cells #1, #2, #3, 

20 and #4 reproduced. 

The AV file shown in Fig. 44A is a movie that was 
broadcast on television. Fig. 44F shows the image content 
of the different sections in this AV file. The section 
between time tO and time tl is the credit sequence VI which 

25 shows the cast and director of the movie. The section 

between time tl and time t2 is the first broadcast sequence 
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V2 of the movie itself. The section between time t2 and 
time t3 is a commercial sequence V3 that was inserted into 
the TV broadcast. The section between time t3 and time t4 
is the second broadcast sequence V4 in the movie. The 
section between time t5 and time t6 is the third broadcast 
sequence V5 in the movie. 

Here, times tl, t2, t3, t4, t5, and t6 are set as 
Mark#l x Mark#2. Mark#3 N Mark#4 x Mark#5. and Mark#6, and pairs 
of marks are set as cells. The display order of cells is 
set as a program chain. 

When performing a read as shown in Fig. 44E, AV data 
editing unit 15 has the credit sequence VI skipped, so that 
the reproduction starts with the first movie sequence V2 
given between the time tl and the t2 . Following this, the 
AV data editing unit 15 has the commercial sequence V3 
skipped, and has the second movie sequence V4 between the 
time t3 and the t4 reproduced. 

The following is a description of the operation of the 
AV data editing unit 15 when the user indicates a real edit 
operation, with reference to Figs. 45A to 45E and Figs. 46A 
to 4 6F. Figs. 45A to 45E show a supplementary example of 
the processing of the AV data editing unit 15 in the 
flowchart of Fig. 43. The variables mx, Af in the flowchart 
of Fig. 43 and Figs. 45A to 45E indicate a position in the 
AV file. The following explanation deals with the 
processing of the AV data editing unit 15 for a real edit 
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operation. 

First, in step S8, the AV data editing unit 15 
determines at least two sections that are to be cut out from 
the present AV file in accordance with the program chain 
5 that was generated during a virtual edit operation. 

The "source AV file" in Fig. 45A has been given the 
mark points Mark#l, #2, #3, .... #8. The cells that have 
been set for this source AV file are defined by pairs of the 
mark points Mark#l, #2, #3, .... #8, so that the AV data 

10 editing unit 15 treats the mark points in each pair as an 

editing start point and an editing end point, respectively. 
As a result, the AV data editing unit 15 treats the pair of 
Marks #1 and #2 as the editing start point "In(l)" and the 
editing end point "Out(l)". The AV data editing unit 15 

15 similarly treats the pair of Marks #3 and #4 as the editing 
start point "In (2)" and the editing end point "Out (2)", the 
the pair of Marks #5 and #6 as the editing start point 
"In (3)" and the editing end point "Out (3)", and the pair of 
Marks #7 and #8 as the editing start point "In(4) fT and the 

20 editing end point "Out (4)". 

The period between Mark#l and Mark#2 corresponds to the 
first movie sequence V2 between the time tl and the time t2 
shown in Fig. 44F. Similarly, the period between Mark#3 and 
Mark#4 corresponds to the second movie sequence V4 between 

25 the time t3 and the time t4 shown in Fig. 44F, and the 

period between Mark#5 and Mark#6 corresponds to the second 
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movie sequence V5 between the time t5 and the time t6. 
Accordingly, by indicating this real edit operation, the 
user obtains an AV file that only includes the movie 
sequences V2, V4, and V5 . 
5 Next, in step S9, the AV data editing unit 15 issues a 

SPLIT command to the AV file system unit 11 to have the 
determined split region divided into mx AV files (where mx 
is an integer no less than 2) . The AV data editing unit 15 
treats each closed area indicated by a pair of an editing 
10 start point and an editing end point in Fig. 45A as an area 
to be cut out, and so cuts out the four AV files shown in 
Fig. 45B. 

The AV data editing unit 15 hereafter specifies one of 
the cut-out mx AV files using the variable Af, with the cut- 

15 out files being numbered AV file Afl, Af2, Af3, ... Afm. In 
step S10, the AV data editing unit 15 sets the variable Af 
at "1" to initialize the variable Af. In step Sll, the AV 
data editing unit 15 issues a READ command to the AV file 
system unit 11 for the VOBUs (hereinafter called the "end 

20 part") located at the end of the AV file Af and the VOBUs 
(hereinafter called the "first part") located at the start 
of the AV file Af +1 . After issuing these commands, in step 
S12 the AV data editing unit 15 uses the same procedure as 
the second embodiment to re-encode the last part of AV file 

25 Af and the first part of AV file Af. 

After the re-encoding, the AV data editing unit 15 
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issues a SHORTEN command to the AV file system unit 11 for 
the last part of the AV file Af and the first part of the AV 
file Af+1 (Af2) . 

In Fig. 45C, the last part of the AV file Afl and the 
5 first part of the AV file Af2 are read as a result of the 
READ command and are re-encoded. As a result of the re- 
encode process, the re-encoded data produced by re-encoding 
the read data is accumulated in the memory of the DVD 
recorder 70. In step S13, the AV data editing unit 15 

10 issues a SHORTEN command, which results in the area formerly 
occupied by the read last and first parts being deleted. 

It should be noted that the deletion performed in this 
way results in one of the two following cases. 

The first case is when regardless of whether either of 

15 the AV file Afl or the AV file Af+1, whose sections to be 

re-encoded have been deleted, has an continuous length that 
is equal to or greater than the AV block length, the 
continuous length of the other AV file is below the data 
size of an AV block. Since the length of an AV block is set 

2 0 at the length which prevents overflows occurring, if AV file 
Af or Af+1 is reproduced in a state where its continuous 
length is shorter than the length of an AV block, an 
underflow will occur in the track buffer. 

The second case is where the data size of the data (in- 

25 memory data) that has been re-encoded and stored in the 

memory is below the data size (length) of an AV block. When 
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the data size of the in-memory data is large and so would 
occupy a region on a DVD-RAM that is equal to or greater 
than one AV block, the data may be stored at a different 
position on the DVD-RAM away from the AV files Af and Af +1 . 
5 However, when the data size of the in-memory data is smaller 
than one AV block, the data cannot be stored at a different 
position on the DVD-RAM away from the AV files Af and Af +1 . 

This is for the following reasons. During a read 
performed for in-memory data that is smaller than the size 

10 of an AV block but is stored at a separate position, a 

sufficient amount of data cannot be accumulated in the track 
buffer. Should the jump from the in-memory data to the AV 
file Af+1 take a relatively long time, an underflow will 
occur in the track buffer while the jump is taking place. 

15 In Fig. 45D, the broken lines show that the last part 

of the AV file Afl and the first part of the AV file Af2 
have been deleted. This results in the length of the AV 
file Afl being below the length of an AV block, and in the 
length of the in-memory data being below the length of an AV 

20 block. 

If this AV file Afl is left as it is, there is the risk 
that an underflow will occur when jumping from the AV file 
Afl to the AV file Af 2 . To prevent the occurrence of such 
underflows, in step S14 the AV data editing unit 15 issues a 
2 5 MERGE command for the AV file Afl and the AV file Af+1. 

As shown in Fig. 45E and Fig. 4 6A, this processing 
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results in the linking of the AV file Afl and the re-encoded 
VOBUs so that the continuous length of the recording region 
for all the extents forming the AV file Afl ends up equal to 
or longer than the length of an AV block. After issuing the 
5 MERGE command, the AV data editing unit 15 judges in step 
S15 whether the variable Af matches the number of AV files 
mx-1. If the numbers do not match, the AV data editing unit 
15 increments the variable Af in step S16 and returns to 
step Sll. In this way, the AV data editing unit 15 repeats 

10 the processing in steps Sll to S14. 

After the variable Af has been incremented to become 
"2", the AV data editing unit 15 issues a READ command so 
that the last part of the AV file Af2 (after the previous 
linking) and the first part of the AV file Af3 are read, as 

15 shown in Fig. 4 6B. Once the VOBUs in this last part and 
first part have been re-encoded, the resulting re-encoded 
data is stored in the memory of the DVD recorder 70. 

The regions on the DVD-RAM that were originally 
occupied by the first part and the last part are deleted as 

2 0 a result of the SHORTEN command that the AV data editing 

unit 15 issued by the step S13. As a result, the remaining 
AV file Af3 has a continuous length that is below the length 
of an AV block. The AV data editing unit 15 issues a MERGE 
command to the AV file system unit 11 for the AV files Af2 

25 and Af3, as shown in Fig. 4 6D and 4 6E. This procedure is 
repeated until the variable Af is equal to the value mx-1. 
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As a result of the above processing, the extents in the 
storage area only contain the movie sequences V2, V4, and 
V5 . These extents each have a continuous length that is 
above the length of an AV block, so that it is guaranteed 
5 that there will be no interruptions to the image display 
during the reproduction of these AV files. 

The period between the Mark#l and the Mark#2 
corresponds to the first movie sequence V2 . The period 
between the Mark#3 and the Mark#4 corresponds to the first 
10 movie sequence V4, and the period between the Mark#5 and the 
Mark#6 corresponds to the third movie sequence V5 . As a 
result, by performing an edit operation, the user can obtain 
a sequence composed of AV files for only the movie sequences 
V2, V4, and V5 . 

15 

(3-2-7-1-2) Processing of the Av File System Unit 11 When a 

Split Command Is Issued 

The following explanation deals with the details of the 

processing by the AV file system unit 11 when providing 
20 extended functions in response to a SPLIT command. Fig. 48A 

shows the operation of the AV file system unit 11 when 

providing extended functions in response to a SPLIT command. 

In this flowchart, one out of the mx pairs of an editing 

start point (In point) and an editing end point (Out point) 
25 is indicated using the variable h. In step S22, the value 

"1" is substituted into the variable h so that the first 
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pair of In point and Out point are processed. 

The AV file system unit 11 generates a file entry (h) 
in step S31, and adds the file identifier (h) for the file 
entry (h) in a directory file of a temporary directory. 
5 In step S33, the AV file system unit 11 calculates the 

first address s of the sequence of u logical blocks (where 
u^l) from the logical block corresponding to the In point 
(h) to the logical block corresponding to the Out point (h) , 
and the number of occupied blocks r. 

10 In step S34, the AV file system unit 11 generates u 

allocation descriptors within the file entry (h) . In step 
S35, the AV file system unit 11 records the first address s 
of the sequence of u logical blocks and the number of 
occupied blocks r into the each of the u allocation 

15 descriptors. In step S35, the AV file system unit 11 judges 
whether the variable h has reached the value mx-1. 

If the variable h has not reached this value, the AV 
file system unit 11 increments the variable h and returns to 
step S31. By doing so, the AV file system unit 11 repeats 

20 the processing in steps S31 to S35 until variable h reaches 
the value mx-1, and so cuts out the closed sections within 
each of the mx-1 pairs of an In point and an Out point as AV 
files . 



25 (3-2-7-1-3) Processing of the A V File System Unit 11 Wh^n a 

Shorten Command Is Issued 
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The explanation deals with the processing of the AV 
file system unit 11 when providing extended functions in 
response to a SHORTEN command. Fig. 48 is a flowchart 
showing the content of this processing. 
5 In step S38, the AV file system unit 11 calculates both 

the first address c of the logical block sequence between 
the deletion start address and the deletion end address that 
specify the area to be deleted and the number of occupied 
blocks d. In step S45, the AV file system unit 11 accesses 

10 the allocation identifiers of the AV file whose first or 

last part is to be deleted. In step S46, the AV file system 
unit 11 judges whether the area to be deleted is the first 
part of the extent of an AV file. 

If the area to be deleted is the first part of an 

15 extent ("Yes" in step S46) , the AV file system unit 11 

advances to step S47 and updates the storage first address p 
of the extent to the storage first address p+c*d in the 
allocation descriptor . 

After this, in step S48 the AV file system unit 11 

20 updates the data size q of the extent of the number q of 
occupied blocks given in the allocation descriptor to the 
data size q-c*d. On the other hand, if in step S46 the AV 
file system unit 11 finds that the area to be deleted is the 
last part of an AV file, the AV file system unit 11 proceeds 

25 directly to step S48, and updates the data size q of the 
extent of the number q of occupied blocks given in the 
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allocation descriptor to the data size q-c*d. 

(3-2-7-1-4) Processing of the AV file syst em unit 11 when a 
MERGE command is Issued 
5 The following explanation deals with the processing 

content of the AV file system unit 11 when providing 
extended functions in response to a MERGE command. The 
following explanation aims to clarify the procedure used to 
process the areas surrounded by the dot-dash lines y3, y4 in 
10 Fig. 45E and Fig. 46D. 

In response to a MERGE command, the AV file system unit 
11 arranged the AV files Af and Af+1, which were partially 
deleted as a result of the SPLIT and SHORTEN commands, and 
the re-encoded data (in-memory data) , which is present in 
15 the memory of the DVD recorder 70 as a result of the re- 
encoding, onto the DVD-RAM in a way that enables the 
seamless reproduction of the AV file Af, the data in the 
memory, and the AV file Af+1 in that order. 

Fig. 47A shows an example of the AV data processed by 
20 the AV file system unit 11 when providing extended functions 
in response to a MERGE command. In Fig. 47A, the AV files x 
and y have been processed according to a SPLIT command. 

The virtual editing is assumed to have defined a 
reproduction route whereby the AV data is reproduced in the 
25 order AV file x ~> in-memory data -> AV file y. 

Fig. 47A shows an example reproduction route for the AV 
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data in the AV files x and y. In Fig. 47 A, the horizontal 
axis represents time, so that the reproduction route can be 
seen to set the display order as AV file x in-memory data 
AV file y. 

5 Of the AV data in AV file x, the data part m located at 

the end of AV file x is stored in a consecutive area of the 
DVD-RAM, with this being called the "former extent". 

Of the AV data in AV file y, the data part n located at 
the start of AV file y is also stored in a consecutive area 

10 of the DVD-RAM, with this being called the "latter extent". 

As a result of the "SPLIT" command, AV files x and y 
are obtained with certain sections of AV data having been 
cut away. However, while the file system manages the areas 
on the disc that correspond to the cut-away data as if they 

15 were empty, the data of the original AV file is in fact left 
as it is in the logical blocks on the DVD-RAM. 

It is assumed that when the reproduction route is set 
by the user, the user does not need to consider the way in 
which AV blocks on the DVD-RAM store the cut-away AV files. 

20 As a result, there is no way in which the positions on the 
DVD-RAM storing the former and latter extents can be 
identified for certain. Even if the reproduction route 
specifies the order as AV file x ~> AV file y, there is still 
the possibility of AV data that is unrelated to the present 

25 reproduction route being present on the disc between the 
former and the latter extent. 
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In view of the above consideration, the linking of AV 
files cut away by a SPLIT command does not assume that the 
former extent and latter extent are recorded at consecutive 
positions on the DVD-RAM, and so should instead assume that 
the former extent and latter extent are recorded at 
completely unrelated positions on the DVD-RAM. 

Here, it should be assumed that at least one "other 
file extent", which is unrelated to the reproduction route 
indicating the AV files x and y, is present between the 
storage regions of the former extent and the latter extent. 

Fig. 47B shows a representation of the positional 
relationship of the storage areas on the DVD-RAM of the 
former extent and the latter extent, in view of the above 
consideration . 

The AV file x including the former extent is partially 
cut away as a result of the SPLIT command, and so includes 
an empty area where the former extent was formerly present. 
This area is called the Out area. As described above, this 
Out area in fact still logically includes the data of the AV 
file x that was cut out, although the AV file system unit 11 
treats the area as an empty area since the SPLIT command has 
already been issued. 

The AV file y including the latter extent is partially 
cut away as a result of the SPLIT command, and so includes 
an empty area where the latter extent was formerly present. 
This area is called the In area. As described above, this 
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In area in fact still logically includes the data of the AV 
file y that was cut out, although the AV file system unit 11 
treats the area as an empty area since the SPLIT command has 
already been issued. 
5 In Fig. 47B, the former extent is stored at a preceding 

position to the latter extent, though this merely 
illustrates one example, so that it is perfectly possible 
for the latter extent to be stored at a preceding position 
to the former extent. 

10 In the present example, the other file extent is 

present between the former extent and the latter extent. 
While the In area and the Out area are ideal for the 
recording of the in-memory data, the continuous length of 
the In area and the Out area is restricted due to the 

15 presence of the other file extent between the former extent 
and the latter extent. 

In step S62 in the flowchart of Fig. 49, the AV file 
system unit 11 calculates the data size of the Out area, and 
the data size of the In area. 

20 On finding the data size of the In area and the Out 

area, the AV file system unit 11 refers to the data size m 
of the former extent and the data size n of the latter 
extent and judges whether the former extent could cause an 
underflow in the track buffer during reproduction. 

25 

(3-2-7-1-4-1) Processing When the Former Extent m is Ipas 
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than the AV Bloc k Length 

When the former extent m is shorter than the AV block 
length and the latter extent n is at least equal to the AV 
block length, an underflow may occur for the former extent 
5 m. The processing proceeds to step S70 in Fig. 50. 

Fig. 50 is a flowchart when the former extent m is 
shorter than the AV block length and the latter extent n is 
at least equal to the AV block length. The processing by 
the AV file system unit 11 in Fig. 50 is explained with 

10 reference to Figs. 51, 52, and 53. Figs. 51, 52, and 53 

show the relationships among the data sizes of the extents m 
and n, the In area and the Out area i and j, the in-memory 
data k, and the AV block B, as well as the areas in which 
each piece of the data is recorded and the areas to which 

15 the data is moved. 

The former extent is shorter than the AV block length. 
As a result, an underflow would occur if no remedial action 
were taken. Accordingly, the flowchart in Fig. 50 shows the 
processing to determine the appropriate storage location for 

20 the former extent and the in-memory data. 

In step S70, it is judged whether the sum of the sizes 
of the former extent and the in-memory data is equal to or 
greater than the AV block length. If so, the processing 
proceeds to step S71, and it is judged whether the Out area 

25 are is larger than the in-memory data. When the Out area is 
larger than the in-memory data, the in-memory data is 
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written in the Out area to make the consecutive length of 
the former extent at least equal to the AV block length. 
Fig. 51A shows an arrangement of the former extent, the 
latter extent, the In area, and the Out area on the DVD-RAM 
5 in a relationship i^k, m+k>B. In Fig. 51B, when the in- 
memory data is recorded in the Out area, the consecutive 
length of the former extent becomes at least equal to the AV 
block length. 

On the other hand, when the Out area is smaller than 
10 the in-memory data, data is moved. Fig. 52A shows an 

arrangement of the former extent, the latter extent, the In 
area, and the Out area on the DVD-RAM in a relationship i<k, 
m+k>B. 

In Fig. 52A, the former extent is first read into the 
15 memory, and in Fig. 52B the former extent is written in an 

empty area in the same zone as the former extent. After the 
first extent has been moved, the in-memory data is written 
immediately after the moved former extent, as shown in Fig. 
52C. 

20 When the sum of the sizes of the former extent and the 

in-memory data is less than the AV block length, the 
processing proceeds to step S72. In step S72, it is judged 
whether the sum of the sizes of the former extent, the 
latter extent, and the in-memory data is at least equal to 

25 two AV block lengths is judged. When the sum of the sizes 

is less than the AV block length, even if data is moved, the 
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size is less than the AV block length. As a result, an 
underflow occurs. When the sum of the sizes is less than 
two AV block lengths, even if the former extent, the in- 
memory data, and the latter extent are written in a logical 
5 block, the recording time will not be too long. In the 

flowchart in Fig. 50, when the sum of the sizes of the in- 
memory data, the former extent, and the latter extent is 
less than two AV blocks, the processing proceeds from step 
S72 to step S73, and the former extent and the latter extent 
10 are moved. 

Fig. 53A shows an arrangement of the former extent, the 
latter extent, the In area, and the Out area on the DVD-RAM 
in a relationship i<k, m+k<B, B^m+n+k<2B. In this case, 
a search is performed for an empty area in the same zone as 
15 the former extent and the latter extent. When an empty area 
is found, the former extent is read into the memory and is 
written in the empty area to move the former extent to the 
empty area, as shown in Fig. 53B. After the move, the in- 
memory data is written just after the moved former extent, 
20 as shown in Fig. 53C. After the in-memory data has been 
written, the latter extent is read into the memory and is 
written immediately after the occupied area of the in-memory 
data to move the latter extent to the empty area, as shown 
in Fig. 53D. 

25 When the sum of the sizes of the in-memory data, the 

former extent, and the latter extent is at least equal to 
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two AV block lengths, the processing proceeds from step S72 
to step S74. When the sum of the sizes is equal to or 
greater than two AV block lengths, it will take a long time 
to write the data into the logical block. Meanwhile, a 
5 simple method in which the former extent is moved and the 

in-memory data is written just after the moved former extent 
should not be adopted in view of the access speed. Here, it 
should be especially noted that the processing proceeds from 
step S72 to step S74 because the sum of the sizes of the in- 

10 memory data and the former extent is less than the AV block 
length. The reason why the sum of the sizes of the in- 
memory data and the former extent is less than the AV block 
length yet the sum of the sizes of the in-memory data, the 
former extent, and the latter extent is at least equal to 

15 two AV block lengths is that the latter extent size is 

relatively large, with the difference between the latter 
extent size and the AV block length is being large. As a 
result, when the sum of the sizes of the former extent and 
the in-memory data is less than the AV block length, part of 

20 the data in the latter extent may be added to the sum, with 
there being no risk of the remaining data size of the latter 
extent being insufficient. 

When the sum of the sizes of the in-memory data, the 
former extent, and the latter extent is at least equal to 

25 two AV block lengths, the processing proceeds from step S72 
to step S74, and the data are linked in the manner shown in 
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Figs. 54A to 54D. 

Fig. 54A shows an arrangement of the former extent, the 
latter extent, the In area, and the Out area on the DVD-RAM 
in a relationship m+k<B, m+n+k;>2B. In this case, a search 
5 is performed for an empty area in the same zone as the 

former extent and the latter extent. When such an empty 
area is found, the former extent is read into the memory and 
is then written in the empty area to move the former extent, 
as shown in Fig. 5 4B. Next, the in-memory data is written 

10 immediately after the moved former extent, as shown in Fig. 
54C. When the in-memory data has been written, a set of 
data that is sufficiently large to make the size of the data 
in this empty area equal to AV block size is moved from the 
start of the latter extent just after the in-memory data as 

15 shown in Fig. 54D. 

After the former extent, the in-memory data, and front 
part of the latter extent are linked in the above-described 
procedure, the file entries of the AV file Af that includes 
the former extent and the AV file Af+1 are integrated. One 

20 integrated file entry is obtained, and the processing ends. 

(3-2 -7-1-4-2) Processing When the Latter Extent n 

is shorter than the AV Bloc k Length 
When the judgment "No" is given in step S63 in the 
25 flowchart of Fig. 49, the processing proceeds to step S64 
where it is judged whether the former extent m is at least 
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equal to the AV block length but the latter extent n is 
shorter than the AV block length. In other words, in step 
S63, it is judged whether an underflow may occur for the 
latter extent. 

5 Fig. 55 is a flowchart when the latter extent is 

shorter than the AV block length and the former extent is at 
least equal to the AV block length. The processing by the 
AV file system unit 11 in the flowchart in Fig. 55 is 
explained with reference to Figs. 56, 57, 58 and 59. Figs. 

10 56, 57, 58 and 59 show the relationships among the data 

sizes of the extents m and n, the In area and the Out area i 
and j, the in-memory data k, and the AV block B, as well as 
the areas in which each piece of the data is recorded and 
the areas to which the data is moved. 

15 In step S75, it is judged whether the sum of the sizes 

of the latter extent and the in-memory is at least equal to 
the AV block length. If so, the processing proceeds from 
step S75 to step S76, where it is judged whether the In area 
is larger than the in-memory data. Fig. 56A shows an 

20 arrangement of the former extent, the latter extent, the In 
area, and the Out area on the DVD-RAM in a relationship j^k, 
n+k>B. In Fig. 56B, the recording of the in-memory data in 
the In area results in the consecutive length of the latter 
extent becoming at least equal to the AV block length. 

25 On the other hand, when the In area is smaller than the 

in-memory data, data is moved. Fig. 57A shows an 



191 



WO 99/14935 



PCT/JP98/04174 



arrangement of the former extent, the latter extent, the In 
area, and the Out area on the DVD-RAM in a relationship j<k, 
n+k^B. In this case, a search is performed for an empty 
area in the same zone as the former extent and the latter 
5 extent. When such an empty area is found, the in-memory 

data is written in the empty area as shown in Fig. 57B. The 
latter extent is then read into the memory and is written 
immediately after the occupied area of the in-memory data, 
as shown in Fig. 57C. 

10 When the sum of the sizes of the latter extent and the 

in-memory data is less than the AV block length, the 
processing proceeds from step S75 to step S77. In step S77, 
it is judged whether the sum of the sizes of the former 
extent, the latter extent, and the in-memory data is at 

15 least equal to two AV block lengths. 

When the sum of the sizes is less than two AV block 
lengths, the processing proceeds to step S78. Fig. 58A 
shows an arrangement of the former extent, the latter 
extent, the In area, and the Out area on the DVD-RAM in a 

20 relationship j<k, n+k<B, m+n+k<2B. In step S78, the AV file 
system unit 11 searches for an empty area in the same zone 
as the former extent and the latter extent. When such an 
empty area is found, the former extent is read into the 
memory and is written into the empty area to move the former 

25 extent to the empty area, as shown in Fig. 58B. Next, the 

in-memory data is written immediately after the moved former 
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extent, as shown in Fig. 58C. When the in-memory data has 
been written, the latter extent is read into the memory and 
is written immediately after the area occupied by the in- 
memory data to move the latter extent to the empty area, as 
5 shown in Fig. 58D. 

When the sum of the sizes of the in-memory data, the 
former extent, and the latter extent is at least equal to 
two AV block lengths, the processing proceeds from step S77 
to step S7 9, and the data are linked in the manner shown in 

10 Figs. 59A to 59D. 

Fig. 5 9A shows an arrangement of the former extent, the 
latter extent, the In area, and the Out area on the DVD-RAM 
in a relationship n+k<B, m+n+k^2B. In this case, a search 
is performed for an empty area in the same zone as the 

15 former extent and the latter extent. When such an empty 
area is found, data with a data size of which is (the AV 
block length - (n+k) ) is moved from the end of the former 
extent to the empty area, as shown in Fig. 59B. As shown in 
Fig. 59C, the in-memory data is written immediately after 

20 this data moved from the former extent. When the in-memory 
data has been written, the latter extent is moved to 
immediately after the occupied area of the in-memory data, 
as shown in Fig. 5 9D. 

When the judgement "No" is given in step S64 in the 

25 flowchart in Fig. 49, the processing proceeds to step S65, 
where it is judged whether the both the former extent m and 
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the latter extent n are shorter than the AV block length is 
judged. In other words, it is judged whether an underflow 
may occur for both the former extent m and the latter extent 
n . 

5 Fig. 60 is a flowchart for when both the former extent 

and the latter extent are shorter than the AV block length. 
The processing by the AV file system unit 11 in the 
flowchart in Fig. 60 is explained with reference to Figs. 
61, 62, 63 and 64. Figs. 61, 62, 63 and 64 show the 
10 relationships among the data sizes of the extents m and n, 
the In area and the Out area i and j, the in-memory data k, 
and the AV block B, as well as the areas in which each piece 
of the data is recorded and the areas to which the data is 
moved . 

15 In step S80 in this flowchart, it is judged whether the 

sum of the sizes of the in-memory data, the former extent, 
and the latter extent is at least equal to AV block length. 
If not, the processing proceeds to step S81. In this case, 
the sum of the sizes of the former extent, the in-memory 

20 data, and the latter extent is shorter than the AV block 
length. As a result, it is judged whether there is an 
extent which follows the latter extent. When no extent 
follows the latter extent, the latter extent is at the end 
of the AV file that is created by the linking of data, so 

25 that no additional processing is needed. When an extent 

follows the latter extent, an underflow may occur since the 
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sum of the sizes of the former extent, the in-memory data, 
and the latter extent is less than the AV block length. In 
order to avoid such underflow, when the extent following the 
latter extent is linked to the latter extent by the link 
5 processing shown in Figs. 61A-61D. Fig. 61A shows an 

arrangement of the former extent, the latter extent, the In 
area, and the Out area on the DVD-RAM in a relationship 
m+n+k<B. In step S81, the AV file system unit 11 writes the 
in-memory data in the In area, as shown in Fig. 61B. When 

10 the in-memory data has been written in the In area, the AV 
file system unit 11 reads the latter extent into the memory 
and writes the read latter extent immediately after the area 
occupied by the in-memory data to move the latter extent to 
the empty area, as shown in Fig. 61C. 

15 Then, as shown in Fig. 61D, the AV file system unit 11 

takes data whose size is (the AV block length - (the former 
extent + the in-memory data + the latter extent) ) from the 
extent following the latter extent. The AV file system unit 
11 links this data with the former extent, the in-memory 

20 data, and the latter extent. 

When the sum of the sizes of the former extent, the 
latter extent, and the in-memory data is at least equal to 
the AV block length, the processing proceeds to step S82. 
In step S82, the AV file system unit 11 judges whether the 

25 data size of the Out area following the former extent is 

less than the sum of the sizes of the latter extent and the 
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in-memory data. If not, the processing proceeds to step 
S83. Fig. 62A shows an arrangement of the former extent, 
the latter extent, the In area, and the Out area on the DVD- 
RAM in a relationship i;>n+k, m+n+k;>B. In step S83, the AV 
5 file system unit 11 writes the in-memory data into the In 
area, as shown in Fig. 62B. After writing the in-memory 
data, the AV file system unit 11 reads the latter extent 
into the memory and writes the latter extent immediately 
after the occupied area of the in-memory data to move the 

10 latter extent. 

When the data size of the Out area following the former 
extent is less than the sum of the sizes of the latter 
extent and the in-memory data, the processing proceeds from 
step S82 to step S84. In step S84, it is judged whether the 

15 data size of the In area preceding the latter extent is less 
than the sum of the sizes of the former extent and the in- 
memory data. If not, the processing proceeds to step S85. 
Fig. 63A shows an arrangement of the former extent, the 
latter extent, the In area, and the Out area on the DVD-RAM 

20 in a relationship i<n+k, m+n+k;>B. In step S85, the AV file 
system unit 11 writes the in-memory data in the In area as 
shown in Fig. 63B. After writing the in-memory data, the AV 
file system unit 11 reads the former extent into the memory 
and writes the former extent into a storage area immediately 

25 before the occupied area of the in-memory data to move the 
former extent to the In area, as shown in Fig. 63C. 
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When the judgement "No" is given in step S84, the 
processing proceeds to step S86. Fig. 64A shows an 
arrangement of the former extent, the latter extent, the In 
area, and the Out area on the DVD-RAM in a relationship 
5 i<n+k, j<m+k, m+n+k^B. In step S86, it is judged whether 
the sum of the sizes of the former extent, the latter 
extent, and the in-memory data is more than two AV block 
lengths. If not, the AV file system unit 11 searches for an 
empty area in the same zone as the former extent. When an 

10 empty area is found, the AV file system unit 11 reads the 
former extent into the memory and writes the read former 
extent into the empty area to move the former extent to the 
empty area, as shown in Fig. 64B. After the move, the AV 
file system unit 11 writes the in-memory data into a storage 

15 area immediately after the moved former extent, as shown in 
Fig. 64C. After writing the in-memory data, the AV file 
system unit 11 reads the latter extent into the memory and 
writes the latter extent into a storage area just after the 
occupied area of the in-memory to move the latter extent to 

20 the empty area, as shown in Fig. 64D. 

When the combined size of the former extent, the latter 
extent, and the in-memory data exceeds AV blocks, it is 
judged whether either the Out area or the In area is large. 
When the Out area is large, a part of the in-memory data is 

25 recorded in the Out area to make the continuous length equal 
to AV block length. The remaining part of the in-memory 
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data is recorded in a different empty area, and the latter 
extent is moved to a position directly after this remaining 
part of the in-memory data. 

When the In area is large, the AV file system unit 11 
5 moves the former extent to an empty area and records a first 
part of the in-memory data to make the continuous length 
equal to AV block length. After this, the remaining part of 
the in-memory data is recorded in the In area. 

As a result of the above processing for moving extents, 
10 the total consecutive length can be kept equal to or below 2 
AV block lengths. 

After the former extent, the in-memory data, and front 
part of the latter extent are linked in the above-described 
procedure, the file entries of the AV file Af that includes 
15 the former extent and the AV file Af+1 are integrated. One 
integrated file entry is obtained, and the processing ends. 

(3-2-7-1-4-3) Processing When Both the Former Extent and the 
Latter Extent are at Least Equal to the AV Blo ck Length 

20 When the judgement "No" is given in step S65 in the 

flowchart of Fig. 49, the processing proceeds to step S66 
where it is judged whether the in-memory data is at least 
equal to the AV block length. If so, the in-memory data is 
recorded in an empty area and the processing ends. 

25 When the judgment "No" is given in step S66 in the 

flowchart of Fig. 49, the AV file system unit 11 judges 

198 



WO 99/14935 



PCT/JP98/04174 



whether the former extent m is at least equal to the AV 
block length, the latter extent n is at least equal to the 
AV block length, but the in-memory data is smaller than the 
combined size of the In area i and the Out area j. Fig. 65 
5 is a flowchart when the latter extent is at least equal to 
the AV block length. 

Figs. 66A-66D show an supplementary example showing the 
processing of the AV file system unit 11 in Fig. 65. In 
Fig. 66A, the former extent and latter extent are both at 
10 least equal to the AV block length. Figs. 66B-66D show how 
the in-memory data and extents are recorded in the In area, 
Out area, and other empty areas as a result of the steps in 
Fig. 65. 

In this case, there is no risk of an underflow 
15 occurring for either the former or the latter extent. It 
would be ideal, however, if the in-memory data could be 
recorded in at least one of the Out area following the AV 
file Af and the In area preceding the AF file Af+1 without 
having to move the former or latter extent. 
20 In step S87 of the flowchart in Fig. 65, it is judged 

whether the size of the Out area exceeds the data size of 
the in-memory data. If so, the in-memory data is simply 
recorded into the Out area in step S88, as shown in Fig. 
66B. 

25 If the size of the Out area is below the data size of 

the in-memory data, the processing proceeds to step S8 9, 
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where it is judged whether the size of the In area exceeds 
the data size of the in-memory data. If so, the in-memory 
data is simply recorded into the In area in step S90, as 
shown in Fig. 66C. If the in-memory data cannot be recorded 
into either the Out area or the In area, the processing 
proceeds to step S91 where the in-memory data is divided 
into two parts that are respectively recorded in the Out 
area and In area, as shown in Fig. 66D. 

After the former extent, the in-memory data, and front 
part of the latter extent are linked in the above-described 
procedure, the file entries of the AV file Af that includes 
the former extent and the AV file Af+1 are integrated. One 
integrated file entry is obtained, and the processing ends. 

(3-2-7-1-4-4) Processing Whe n Both the F or m er Extent and the 
Latter Extent are at Least Equal to the AV Block Length 

In step S69 in the flowchart of Fig. 49, it is judged 
whether the former extent m is at least equal to the AV 
block length and the latter extent n is at least equal to 
the AV block length, but the size of the in-memory data k 
exceeds the combined size of the Out area j and the In area 
i . 

Fig. 67 is a flowchart showing the processing when both 
the former extent but the combined size of the In area and 
the Out area is below the data size of the in-memory data. 
Figs. 68A-68E show supplementary examples foe the processing 
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of the AV file system unit 11 in the flowchart of Fig. 67. 
In Fig. 68A, both the former extent and the latter extent 
are at least equal to AV block length. Figs. 68B-68D show 
how the extents and in-memory data are recorded in the In 
area, Out area, and other empty areas as a result of the 
steps in Fig. 67. 

In this case, both the former extent and the 
latter extent are at least equal to AV block length, so that 
there is no risk of an underflow occurring, although the 
recording area of the in-memory data must have a continuous 
length that is at least equal to AV block length. 

In step S92, it is judged whether the total size of the 
former extent and the in-memory data is at least equal to 
two AV block lengths . 

If the total size exceeds two AV block lengths, the 
processing advances to step S93 where data whose size is (AV 
block length-data size of in-memory data k) is read from the 
end of the former extent and moved to an empty area where 
the in-memory data is also recorded. This results in the 
recording state of this empty area and both extents being 
equal to AV block length, as shown in Fig. 68B. 

If the judgement "No" is given in step S92, the 
processing advances to step S94, where it is judged whether 
the total size of the latter extent and the in-memory data 
is at least equal to two AV block lengths. If so, the 
processing follows the pattern in step S92, since an 
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excessively long logical block write operation is to be 
avoided and since a relatively large amount of data can be 
moved from the latter extent without any risk of the latter 
extent ending up shorter than AV block length. 
5 If the total size of the latter extent and the in- 

memory data is at least equal to two AV block lengths, the 
processing advances to step S95, where data whose size is 
(AV block length-data size of in-memory data k) is read from 
the start of the latter extent and moved to an empty area in 

10 the same zone as the former and latter extents, where the 
in-memory data is then also recorded. This results in the 
recording state of this empty area and both extents being 
equal to AV block length, as shown in Fig. 68C. 

If the total size of the former extent and the in- 

15 memory data is below two AV block lengths, and the total 
size of the latter extent and the in-memory data is below 
two AV block lengths, the total data amount to be written 
into logical blocks will be less than two AV block lengths, 
so that the move processing can be performed without concern 

20 for the time taken by the write processing involved. 

Accordingly, when the total size of the former extent 
and the in-memory data is below two AV block lengths, and 
the total size of the latter extent and the in-memory data 
is below two AV block lengths, the processing advances to 

25 step S96, where the larger of the former extent and the 
latter extent is found. In this situation, either the 
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former or the latter extent may be moved, although in the 
present embodiment it is ideal for the smaller of the two to 
be moved, hence this judgement in step S96. When the former 
extent is the smaller of the two, in step S97 the former 
5 extent is moved, with the in-memory data then being recorded 
at a position immediately after the in-memory data. When 
doing so, the continuous length of the data recorded in this 
empty area will be below two AV block lengths, as shown in 
Fig. 68D. 

10 When the latter extent is the smaller of the two, in 

step S98 the latter extent is moved, with the in-memory data 
then being recorded at a position immediately before the in- 
memory data. When doing so, the continuous length of the 
data recorded in this empty area will be below two AV block 

15 lengths, as shown in Fig. 68E. 

After the former extent, the in-memory data, and front 
part of the latter extent are linked in the above-described 
procedure, the file entries of the AV file Af that includes 
the former extent and the AV file Af+1 are integrated. One 

20 integrated file entry is obtained, and the processing ends. 

Flowcharts for "MERGE" processing in a variety of 
circumstances have been explained, with it being possible to 
limit the data size of the moved and recorded data to two AV 
block lengths in the worst case scenario. However, this 

25 does not mean that there are no cases where data that 

exceeds two AV blocks lengths needs to be written, with the 
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following two cases describing such exceptions where data 
that exceeds two AV blocks lengths needs to be written. 

In the first exception, an empty area with a continuous 
length of two AV block lengths is required, although only 
5 separate empty areas of AV block length are available. In 
this case to create an empty area with a continuous length 
of two AV block lengths, AV data for one AV block length 
must be moved. 

In the second exception, in step S81 of Fig. 60, the 

10 moving of data from the latter extent results in the 

remaining part of the latter extent becoming below AV block 
length. In this case, a further move operation becomes 
necessary, with the total amount of moved data in the entire 
processing exceeding two AV block lengths. 

15 While the above explanation only deals with the linking 

of two AV files and in-memory data, a MERGE command may be 
executed for only one AV file and in-memory data. This case 
is the same as when adding data to the final extent in an AV 
file, so that the total size after such addition needs to be 

20 at least equal to AV block size. As a result, the in-memory 
data is recorded into the Out area following this final 
extent. When the Out area is too small to record all the 
in-memory data, the remaining part of the in-memory data may 
be recorded in a separate empty AV block. 

25 The above linking process has been explained for the 

premise of seamless reproduction within a file, although it 
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may also be used for seamless reproduction across files. 
Seamless reproduction across files refers to a branch in 
reproduction from a present AV file to another AV file. In 
the same way as described above, when linking two AV files 
and in-memory data, the continuous length of each extent 
must be at least equal to AV block length, so that a 
thorough link procedure must be used. 

This completes the explanation of the linking procedure 
used by the AV file system unit 11. 

(3-2-7-1-5 ) Updatin g of the VQB Information 

and PGC Information 

The following is an explanation of the updating of the 
VOB information (time map table, seamless linking 
information) and PGC information (cell information) when 
executing a SPLIT command or MERGE command. 

First, the processing when a SPLIT command has been 
executed will be explained. Out of the plurality of AV 
files that are obtained by the execution of the SPLIT 
command, one AV file is assigned the same AV_File_ID as the 
AV file which recorded the VOB from which it was split. The 
AV_File_IDs of the other AV files split from the AV file 
however need to be assigned new values. 

VOBs that were originally recorded as an AV file will 
lose several sections due the execution of a SPLIT command, 
so that the marks that indicated the lost sections need to 
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be deleted. In the same way, the cell information that gave 
these marks as the start points and end points need to be 
deleted from the RTRW management file. 

In addition to deleting the mark points, it is 
5 necessary to generate new cell information that indicates 
the video presentation start frame of the AV file as 
C_V_S_PTM and the video presentation end frame of the AV 
file as C__V__E__PTM, and to add this new cell information to 
the RTRW management file. 

10 The VOB information that includes the seamless linking 

information and time map table is divided into a plurality 
of parts when the corresponding VOB is divided. In more 
detail, if mx VOBs are produced by the division, the VOB 
information is divided to give mx time map tables and mx 

15 sets of seamless linking information. 

The video presentation start time VOB_V_S_PTM and the 
video presentation end time VOB_V_E_PTM of a VOB generated 
by the processing that accompanies the execution of the 
SPLIT command are respectively set based on the C_V_S_PTM, 

20 C_V_E_PTM indicated by the start point and end point in the 
cell information used for the SPLIT command. The LAST_SCR 
and FIRST_SCR in the seamless linking information are also 
updated. 

The following is a description of how the information 
25 is updated when a MERGE command has been executed. The 

execution of a MERGE command results in one AV file being 
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produced from a plurality of AV files, so that the VOBs that 
are included in this plurality of AV files will be composed 
of sets of frame data that are not interrelated, meaning 
that the time stamps across these AV files will not be 
5 continuous. Since these are managed as a VOB that differs 
from the plurality of VOBs that were originally included in 
different AV files, separate VOB_IDs will be assigned to 
these VOBs. 

The other necessary processing is as described in the 
10 second embodiment. However, the C_V_E_PTM in the cell 
information that specifies a split area needs to be 
increased by the number of frames included in the part of 
the former VOBU that have been encoded. Similarly, the 
C_V_S_PTM in the cell information that specifies a split 
15 area in a latter AV file needs to be decreased by the number 
of frames included in the part of the latter VOBU that have 
been encoded. 
(3-2-3) 

The def ragmentation unit 16 is connected to a fixed 
20 magnetic disc apparatus. This def ragmentation unit 16 reads 
an extent, out of the extents recorded on the DVD-RAM that 
have been subjected to link processing or other processing, 
that has an empty area on either side of its recording area 
and writes this extent onto the fixed magnetic disc 
25 apparatus to generate backup data in the fixed magnetic disc 
apparatus. After writing all of such extent onto the fixed 
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magnetic disc apparatus, the def ragmentation unit 16 reads 
the generated backup data and writes the backup data for the 
backed-up extent into the empty area adjacent to the extent. 
Here, extents which have an empty area adjacent to 
5 their recording area are extents that have been generated by 
the AV file system unit 11 executing a "SPLIT" command or a 
"SHORTEN" command. These empty areas equate to areas have 
been cleared and not since used as the recording area of the 
in-memory data or the moved-to area for an extent when a 

10 MERGE command has been performed. 

Figs. 69A-69D show an example that illustrates the 
operation of the def ragmentation unit 16. In Fig. 69A, 
extent #x is shown as an extent with empty areas i, j on 
both sides of its recording area. As shown in Fig. 69A, the 

15 def ragmentation unit 16 detects this extent, reads it from 
the DVD recorder 70, and writes it onto the fixed magnetic 
disk apparatus. 

As a result of this write operation, backup data is 
generated in the fixed magnetic disk apparatus, as shown in 

20 Fig. 69B. After this, the def ragmentation unit 16 reads the 
backup data from the fixed magnetic disk apparatus, as shown 
in Fig. 69C, and writes the extent onto the DVD-RAM to use 
both the current recording area of the extent #x and the 
empty area j following this recording area. This creates a 

25 continuous empty area of length i+j before the extent #x, as 
shown in Fig. 69D. By next performing this processing for 
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the extent #y, the continuous length of the empty area can 
be further increased. 

The recording performed by the def ragmentation unit 16 
is achieved by first storing an extent on the fixed magnetic 
5 disk apparatus, so that even if a power failure occurs for 
the DVD recorder 7 0 during the writing of the extent back 
onto the DVD-RAM, this writing processing can still be re- 
executed. By generating backup data before moving the 
extents to free large empty areas on the DVD-RAM, there is 

10 no risk of the losing the data in an extent when there is a 
power failure for the DVD recorder 70. 

With the present embodiment described above, the 
editing of a plurality of AV files can be freely performed 
by the user. Even if a plurality of fragmental AV files 

15 with short continuous lengths are generated, the DVD 

recorder 70 will be able to links these short AV files to 
generated AV files with continuous lengths that are at least 
equal to AV block length. As a result, problems caused by 
the fragmentation of AV files can be managed, and 

20 uninterrupted reproduction can be performed for the AV data 
that is recorded in these AV files. 

During the link processing, it is judged whether the 
total size of the data to be written is as least equal to 
two AV block lengths, and if so, the moved amount of 

25 prerecorded AV data is restricted. As a result, it can be 
guaranteed that total size of the data to be written is 
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below two AV block lengths, so that the linking can be 
completed in a short amount of time. 

Even when it is necessary as a result of a user editing 
operation for a plurality of files to record re-encoded data 
5 with a short continuous length, the DVD recorder 70 will 
record this re-encoded data at a recording position that 
allows the re-encoded data to be linked with the AV data 
that precedes or follows it during reproduction. This means 
that the fragmented recording of re-encoded data is 

10 prevented from the outset, so that uninterrupted 

reproduction will be possible for the AV data that is 
recorded in such an AV file. 

It should be noted here that the movement of data may 
also be performed so as to avoid excessive separation on the 

15 disc of two sets of AV data that have been linked together. 
In such a case, the data produced by linking the sets of 
data that are physically separated on the disc is arranged 
in a manner that ensures uninterrupted reproduction of the 
two sets of AV data will be possible. However when special 

20 reproduction such as fast forward is performed, excessive 
separation of the data on the disc will result in jerky 
reproduction of the data. 

To ensure smooth reproduction in such a case, when two 
sets of AV data are linked, if one of the sets of data has a 

25 consecutive length that is several times a predetermined 
amount and an empty block of suitable size is positioned 
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between the two sets of data, the data may be moved to this 
empty block. By doing so, smooth reproduction can be 
ensured for both normal reproduction and special 
reproduction. 

5 It should be noted here that the time information 

may be taken from the mark points in the cell 
information and managed with information such as address 
taken from the time map table in the form of a table. 
By doing so, this information can be presented to the 

10 user as potential selections in a screen showing the 
initial pre-editing state. 

Reduced images (known as "thumbnails") may also be 
generated for each mark point and stored as separate 
files, with pointer information also being produced for 

15 each thumbnail. When displaying the cell information at 
the pre-editing stage, these thumbnails may be displayed 
to show the potential selections that can be made by the 
user . 

Also, while the present embodiment describes a case 
20 when video data and audio data are handled, this is not an 
effective limitation for the techniques of the present 
invention. For a DVD-ROM, sub-picture data for subtitles 
that has been run-length encoded and still images may also 
be handled. 

25 The processing of AV file system unit 11 (Figs. 
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48A, 48B, 49-50, 55, 60, 65, 67) that was described in 
this third embodiment using flowcharts can be achieved 
by a machine language program. Such machine language 
program may be distributed and sold having been recorded 
on a recording medium. Examples of such recording 
medium are an IC card, an optical disc, or a floppy 
disc. The machine language program recorded on the 
recording medium may then be installed into a standard 
personal computer. By executing the installed machine 
language programs, the standard personal computer can 
achieve the functions of the video data editing 
apparatus of this third embodiment. 

Fourth Embodiment 

The fourth embodiment of the present invention 
performs a two-stage editing process composed of virtual 
edits and real edits using two types of program chain, 
namely user-defined PGCs and original PGCs. To define 
the user-defined PGCs and the original PGCs, a new table 
is added to the RTRW management file of the first 
embodiment . 

(4-1) RTRW Management File 

The following is a description of the construction 
of the RTRW management file in this fourth embodiment. 
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In the fourth embodiment, the RTRW management file is 
recorded in the same directory as AV files (the RTRW 
directory) , and has the content shown in Fig. 70A. 

Fig, 70A shows a detailed expansion of the stored 
5 content of the RTRW management file in the fourth 

embodiment. This is to say, the logical format located 
on the right side of Fig. 70A shows the logical format 
located on the left side in more detail, with the broken 
guidelines in Fig. 70A showing the correspondence 

10 between the left and right sides. 

From the logical format of VOBs shown in Fig. 7 OA, 
the RTRW management file can be seen to include an 
original PGC information table, a user-defined PGC 
information table, and a title search pointer, in 

15 addition to the VOB information of the first embodiment. 

(4-1-2) Content of the Original PGC Information 

The original PGC information table is composed of a 
plurality of sets of original PGC information. Each set 

20 of original PGC information is information that 

indicates either the VOBs that are stored in a AV file 
present in the RTRW directory or sections within these 
VOBs, in accordance with the order in which these are 
arranged in the AV file. Each set of original PGC 

25 information corresponds to one of the VOBs recorded in 
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an AV file present in the RTRW directory, so that when 
an AV file is recorded in the RTRW directory, sets of 
original PGC information are generated by the video data 
editing apparatus and recorded in the RTRW management 
5 file. 

Fig. 70B shows the data format of a set of original 
PGC information. Each set of original PGC information 
is composed of a plurality of sets of cell information, 
with each set of cell information being composed of a 

10 cell ID (CELL #1, #2, #3, #4... in Fig. 70B) that is a 
unique identifier assigned to the set of cell 
information, an AV file ID (AVF_ID in Fig. 70B) , a 
VOB_ID, a C_V_S_PTM, and a C_V_E__PTM . 

The AV file ID is a column for writing the 

15 identifier of the AV file that corresponds to the set of 
cell information. 

The VOB_ID is a column for writing the identifier 
of a VOB that is included in the AV file. When a 
plurality of VOBs are included in the AV file that 

20 corresponds to the set of cell information, this VOB_ID 
indicates which of the plurality of VOBs corresponds to 
the present set of cell information. 

The cell start time C_V_S__PTM (abbreviated to 
C_V_S_PTM in the drawings) shows the start time of the 

25 cell indicated by the present cell information, and so 
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has a column for writing the PTS that is assigned to the 
start time of the first video field in the section using 
PTM descriptor format. 

The cell end time C_V_E_PTM (abbreviated to 
5 C_V_E_PTM in the drawings) shows the end time of the 

cell indicated by the present cell information, and so 
has a column for writing the end time of the final video 
field in the section using PTM descriptor format. 

The time information given as the cell start time 

10 C_V_S_PTM and cell end time C_V_E_PTM shows the start 

time for an encoding operation by the video encoder and 
the end time for the encoding operation, with these 
corresponding to the mark points inserted by the user. 
The cell end time C_V__E_PTM in each set of cell 

15 information in a set of original PGC information matches 
the cell start time C_V_J3_PTM of the next set of cell 
information in the given order. Since this relationship 
is established between the sets of cell information, an 
original PGC indicates all of the sections in a VOB 

20 without omitting any of the sections. As a result, an 
original PGC is unable to indicate sections of a VOB in 
an order where the sections are interchanged. 



25 



(4-1-3) Content of the User-defined PGC information 

The user-defined PGC information table is composed 
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of a plurality of sets of user-defined PGC information. 
The data format of sets of user-defined PGC information 
is shown in Fig. 70C. Like the sets of original PGC 
information, the sets of user-defined PGC information 
5 are composed of a plurality of sets of cell information, 
each of which is composed of an AV file ID, a VOB ID, a 
C_V_S_PTM, and a C_V_E_PTM . 

A set of user-defined PGC information is composed 
of a plurality of sets of cell information in the same 

10 way as a set of original PGC information, although the 

nature and arrangement of these sets of cell information 
differ to those in a set of original PGC information. 
While a set of original PGC information indicates that 
the sections in a video object are to be sequentially 

15 reproduced in the order in which the sets of cell 

information are arranged, a set of user-defined PGC 
information is not restricted to indicating that the 
sections in a video object are to be reproduced in the 
order in which they are arranged. 

20 The sections indicated by the sets of cell 

information in a user-defined PGC can be the same as the 
sections indicated by the sets of user-defined PGC 
information or a part (partial section) of one of the 
sections indicated by a set of original PGC information. 

25 Note that it is possible for the section indicated by 
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one set of cell information to overlap a section 
indicated by another set of cell information. 

There also may be gaps between a section that is 
indicated by one set of cell information and a section 
5 that is indicated by another set of cell information. 

This means that sets of user-defined PGC information do 
not need to indicate every section in a VOB, so that one 
or more parts of a VOB may not be indicated. 

While original PGCs have strict limitations 

10 concerning their reproduction orders, user-defined PGCs 
are not subject to such limitations, so that the 
reproduction order of cells may be freely defined. As a 
specific example, the reproduction order of the cells in 
a user-defined PGC may be the inverse of the order in 

15 which the cells are arranged. Also, a user-defined PGC 
may indicate sections of VOBs that are recorded in 
different AV files. 

Original PGCs indicate the partial sections in one 
AV file or one VOB in accordance with the order in which 

20 the AV file or VOBs are arranged, so that original PGCs 
may be said to respect the arrangement of the indicated 
data. User-defined PGCs, however, have no such 
restriction, and so are able to indicate the sections in 
the user's desired order. As a result, these user- 

25 defined PGCs are ideal for storing reproduction orders 
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that are provisionally determined by the user for the 
linking of a plurality of sections in VOBs during a 
video data editing operation. 

Original PGCs are associated to AV files and the 
5 VOBs in AV files, and the cells in an original PGC only 
indicate sections in these VOBs. User-defined PGCs, 
meanwhile, are not limited to being associated to 
particular VOBs, so that the sets of cell information 
included in user-defined PGC information may indicate 
10 sections in different VOBs. 

As another difference, an original PGC is generated 
when recording an AV file, while a user-defined PGC may 
be generated at any point following the recording of an 
AV file. 

15 

(4-1-4) Unity of the PGC information - Video Attribute 
Information - AV File 

The following is an explanation of the inter- 
relatedness of the AV files, VOBs, and sets of PGC 

20 information. Fig. 71 shows the inter-relatedness of the 
AV files, VOBs, time map table, and sets of PGC 
information, with the elements that form a unified body 
being enclosed within the frames drawn using thick black 
lines. Note that in Fig. 71, the term "PGC information" 

25 has been abbreviated to "PGCI". 
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In Fig. 71, the AV file #1, the VOB information #1, 
and the original PGC information #1 composed of the sets 
of cell information #1 to #3 have been arranged within 
the same frame, while the AV file #2, the VOB 
5 information #2, and the original PGC information #2 

composed of the sets of cell information #1 to #3 have 
been arranged within a different frame. 

These combinations of an AV file (or VOB) , VOB 
information, and original PGC information that are 
10 present in the same frame in Fig. 71 are called an 

"original PGC" under DVD-RAM standard. A video data 
editing apparatus that complies to DVD-RAM standard 
treats the units called original PGCs as a management 
unit called a video title. 
15 For the example in Fig. 71, the combination of the 

AV file #1, the VOB information #1, and original PGC 
information #1 is called the original PGC #1, while the 
combination of the AV file #2, the VOB information #2, 
and original PGC information #2 is called the original 
20 PGC #2. 

When recording an original PGC, in addition to 
recording the encoded VOBs on the DVD-RAM, it is 
necessary to generate VOB information and original PGC 
information for these VOBs. The recording of an 
25 original PGC is therefore regarded as complete when all 
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three of the AV file, VOB information table, and 
original PGC information have been recorded onto the 
DVD-RAM. Putting this another way, the recording of 
encoded VOBs on a DVD-RAM as an AV file itself is not 
5 regarded as completing the recording of an original PGC 
on the DVD-RAM. 

This is also the case for deletion, so that 
original PGCs are deleted as a whole. Putting this 
another way, when any of an AV file, VOB information and 

10 original PGC information is deleted, the other elements 
in the same original PGC are also deleted. 

The reproduction of an original PGC is performed by 
the user indicating the original PGC information. This 
means that the user does not give direct indications for 

15 the reproduction of a certain AV file or VOBs. 

It should be noted here that an original PGC may 
also be reproduced in part. Such partial reproduction 
of an original PGC is performed by the user indicating 
sets of cell information that are included in the 

20 original PGC, although reproduction of a section that is 
smaller than a cell, such as a VOBU, cannot be 
indicated. 

The following describes the reproduction a user- 
defined PGC. In Fig. 71, it can be seen that the user- 
25 defined PGC information #3, composed of the cells #1 to 
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#4, is included in a separate frame to the original PGCs 
#1 and #2 described earlier. This shows that for DVD- 
RAM standard, the user-defined PGC information is not in 
fact AV data, and is instead managed as a separate 
5 title. 

As a result, a video data editing apparatus defines 
the user-defined PGC information in the RTRW management 
file, and by doing so is able to complete the generation 
of a user-defined PGC. For user-defined PGCs, there is 

10 a relationship whereby the production of a user-defined 
PGC equates to the definition of a set of user-defined 
PGC information. 

When deleting a user-defined PGC, it is sufficient 
to delete the user-defined PGC information from the RTRW 

15 management file, with the user-defined PGC being 
regarded as not existing thereafter. 

The units for reproduction of a user-defined PGC 
are the same as for an original PGC. This means that 
the reproduction of a user-defined PGC is performed by 

20 the user indicating the user-defined PGC information. 
It is also possible for user-defined PGCs to be 
partially reproduced. Such partial reproduction of a 
user-defined PGC is achieved by the user indicating 
cells that are included in the user-defined PGC. 

25 The differences between original PGCs and user- 
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defined PGCs are as described above, but, from the 
viewpoint of the user, there is no need to be aware of 
such differences. This is because the entire 
reproduction or partial reproduction of both types of 
PGCs is performed in the same way by respectively 
indicating the PGC information or cell information. As 
a result, both kinds of PGCs are managed in the same way 
using a unit called a "video title". 

The following is an explanation of the reproduction 
of original PGCs and user-defined PGCs. The arrows 
drawn with broken lines in Fig. 71 show how certain sets 
of data refer to other data. Arrows y2, y4 , y6, and y8 
show the relationship between each VOBU in a VOB and the 
time codes included in the time map table in the VOB 
information, while yl, y3, y5, and y7 show the 
relationship between the time codes included in the time 
map table in the VOB information and the sets of cell 
information. 

Here, it is assumed that the user has indicated one 
of the PGCs, so that a video title is to be reproduced. 
When the indicated PGC is the original PGC #1, the set 
of cell information #1 located at the front of the 
original PGC information #1 is extracted by the 
reproduction apparatus. Next, the reproduction 
apparatus refers to the AV file and VOB identifiers 
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included in the extracted set of cell information #1, 
and specifies the AV file #1, the V0B#1, and the time 
map table #1 for this VOB as the AV file and VOB 
corresponding to this cell information. 
5 The specified time map table #1 includes the size 

of each VOBU that composes the VOB and the reproduction 
period of each VOBU. To improve the data accessing 
ability, the specified time map table #1 also includes 
the address and elapsed time relative to the start of 

10 the VOB for representative VOBUs that are selected at a 
constant interval, such as a multiple of ten seconds. 
As a result, by referring to the time map table using 
the cell start time C_V_S_PTM, as shown by the arrow yl, 
the reproduction apparatus can specify the VOBU in the 

15 AV file that corresponds to the cell start time 

C_V_S_PTM included in the set of cell information #1, 
and so can specify the first address of this VOBU. By 
doing so, the reproduction apparatus can determine the 
first address of the VOBU that corresponds to this cell 

20 start time C_V_S_PTM, can access VOBU #1 as shown by the 
arrow y2, and so can start reading the VOBU sequence 
that starts from VOBU#l. 

Since the set of cell information #1 also includes 
the cell end time C__V_E_PTM, the reproduction apparatus 

25 can access the time map table using this cell end time 
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C_V_E_PTM, as shown by the arrow y3, to specify the VOBU 
in the AV file that corresponds to the cell end time 
C_V_E__PTM included in the set of cell information #1. 
As a result, the reproduction apparatus can determine 
5 the first address of the VOBU that corresponds to the 

cell end time C_V_E_PTM. When the VOBU that corresponds 
to the cell end time C_V_E_PTM is VOBU #10, for example, 
the reproduction apparatus will stop reading the VOBU 
sequence on reaching VOBU#10, as shown by arrow y4 . 

10 By accessing the AV file via the cell information 

#1 and the VOB information #1, the reproduction 
apparatus can read only the section indicated by the 
cell information #1, out of the data in VOB #1 that is 
included in AV file #1. If reads are also performed for 

15 the cell information #2, #3, and #4, all VOBUs that are 
included in VOB#l can be read and reproduced. 

When reproduction is performed for an original PGC 
as described above, the sections in the VOB can be 
reproduced in the order in which they are arranged in 

20 the VOB. 

The following explanation is for when the user 
indicates the reproduction of a video title indicated by 
one of the user-defined PGCs . 

When the indicated PGC is the user-defined PGC #1, 

25 the reproduction apparatus extracts the set of cell 
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information #1 that is positioned at the front of the 
user-defined PGC information #1 for this user-defined 
PGC #1. Next, the reproduction apparatus refers to the 
time map table #1 using the cell start time C__V_S_PTM 
5 included in this cell information #1, as shown by the 
arrow y5, and specifies the VOBU in VOBU #1 that 
corresponds to this cell start time C_VJS_PTM included 
in the cell information #1. In this case, the 
reproduction apparatus specifies VOBU #11 as the VOBU 

10 that corresponds to the cell start time C_V__S_PTM, 

accesses VOBU #11 as shown by the arrow y6, and starts 
reading a VOBU sequence that starts from VOBU #11. 

The cell information #1 included in the user- 
defined PGC #1 also includes the cell end time 

15 C_V_E_PTM, so that the reproduction apparatus refers to 
the time map table using this cell end time C_V_E_PTM, 
as shown by the arrow y7 , and specifies the VOBU in VOB 
#1 that corresponds to the cell end time C_V__E_PTM that 
is included in the cell information #1. When the VOBU 

20 that corresponds to the cell end time C_V_E_PTM is VOBU 
#21, for example, the reproduction apparatus will stop 
reading the VOBU sequence on reaching VOBU #21, as shown 
by arrow y8 . 

As described above, after accessing the AV file via 
25 the cell information #1 and VOB information #1, the 
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reproduction apparatus performs the same processing for 
the cell information #2, #3, and #4 included in the 
user-defined PGC information #1. 

After extracting the cell information #2 which is 
5 located at a position following the cell information #1, 
the reproduction apparatus refers to the AV file 
identifier included in the extracted cell information #2 
and so determines that AV file #2 corresponds to this 
cell information and that time map table #2 corresponds 

10 to this AV file. 

The specified time map table #2 includes the size 
of each VOBU that composes the VOB and the reproduction 
period of each VOBU. To improve the data accessing 
ability, the specified time map table #2 also includes 

15 the address and elapsed time relative to the start of 

the VOB for representative VOBUs that are selected at a 
constant interval, such as a multiple of ten seconds. 
As a result, by referring to the time map table using 
the cell start time C_V_S_PTM, as shown by the arrow y9, 

20 the reproduction apparatus can specify the VOBU in the 
AV file that corresponds to the cell start time 
C_V_S_PTM included in the set of cell information #2, 
and so can specify the first address of this VOBU. By 
doing so, the reproduction apparatus can determine the 

25 first address of the VOBU that corresponds to this cell 
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start time C_V_S_PTM, can access VOBU #2 as shown by the 
arrow ylO, and so can start reading the VOBU sequence 
that starts from VOBU#2. 

Since the set of cell information #2 also includes 
5 the cell end time C_V_E_PTM, the reproduction apparatus 
can access the time map table using this cell end time 
C_V_E_PTM, as shown by the arrow yll, to specify the 
VOBU in the AV file that corresponds to the cell end 
time C_V_E_PTM included in the set of cell information 

10 #2. As a result, the reproduction apparatus can 

determine the first address of the VOBU that corresponds 
to the cell end time C_V_E_PTM. When the VOBU that 
corresponds to the cell end time C_V_E_PTM is VOBU #11, 
for example, the reproduction apparatus will stop 

15 reading the VOBU sequence on reaching VOBU#ll, as shown 
by arrow yl2. 

By reproducing the user-defined PGC information in 
this way, the desired sections in VOBs included in two 
AV files may be reproduced in the given order. 

20 This completes the explanation of the unity of AV 

file, VOB information, and PGC information. The 
following is a description of the title search pointer 
shown in Fig. 70. 

25 (4-1-5) Content of the Ti tle Search Pointer 
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The title search pointer is information for 
managing the VOB information, time map table, PGC 
information, and AV files recorded on a DVD-RAM in the 
units called video titles that were described above. 
5 Each title search pointer is composed of the PGC number 
that is assigned to a set of original PGC information or 
a set of user-defined PGC information, a title type, and 
a title recording history. 

Each title type corresponds to one of the PGC 

10 numbers, and is set at the value "00" to show that the 
AV title with the corresponding PGC number is an 
original type PGC, or is set at the value "01" to show 
that the AV title with the corresponding PGC number is a 
user-defined PGC. 

15 The title recording history shows the data and time 

at which the corresponding PGC information was recorded 
onto the DVD-RAM. 

When the RTRW directory on a DVD-RAM is indicated, 
a reproduction apparatus that complies to DVD-RAM 

20 standard reads the title search pointers from the RTRW 
management file and so can instantly know how many 
original PGCs and user-defined PGCs are given in each 
directory on the DVD-RAM and when each of these video 
titles were recorded in the RTRW management file. 
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(4-1-6) Interchanaeability of User-defined PGCs 

and Original PGCs in a Real Edit 
The user-defined PGC information defined in a 
virtual edit can be used to indicate the linking order 
5 for cells in a real edit, as shown in this fourth 
embodiment . 

Also, once a real edit has been performed as 
described in the fourth embodiment, if a set of user- 
defined PGC information is converted into a set of 
10 original PGC information, original PGC information can 
be easily generated for the VOBs obtained by this 
linking . 

This is because the data construction of the user- 
defined PGC information and the original type 
15 information only differ in the value given as the title 
type, and because the sections of a VOB obtained by a 
real edit are the sections that were indicated by the 
user-defined PGC information before the real edit . 

The following is an explanation of the procedure 
20 for a real edit in this fourth embodiment, and of the 
process for updating user-defined PGC information to 
original PGC information. Fig. 72 shows an example of a 
user-defined PGC and an original PGC. 

In Fig. 72, original PGC information #1 includes 
25 only cell#l, and forms part of an original PGC with 
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V0B#1 and the VOB information. On the other hand, user- 
defined PGC information #2 forms a user-defined PGC 
using only cell#l, cell#2, and cell#3. 

In Fig. 72, cell#l indicates the section from 
5 VOBU#l to VOBU#i, as shown by the broken arrows y51 and 
y52, while cell#2 indicates the section from VOBU#i+l to 
VOBU#j, as shown by the broken arrows y53 and y54, and 
cell#3 indicates the section from VOBU#j+l to V0BU#k+2, 
as shown by the broken arrows y55 and y56. 

10 In the following example, cell#2 is deleted from 

the user-defined PGC information, and the user indicates 
a real edit using the user-defined PGC information #2 
composed of the cells #1 and #3. In Fig. 73, the area 
that corresponds to the deleted cell is shown using 

15 shading. 

Cell#2, which is deleted here indicates one of the 
video frames, out of the plurality of sets of picture 
data included in VOBU #i+l shown within the frame wll, 
using the cell start time C_V_S_PTM. Cell#2 also 

20 indicates one of the video frames, out of the plurality 
of sets of picture data included in VOBU #j+l shown 
within the frame wl2, using the cell end time C_V__E_PTM. 

If a real edit is performed using the user-defined 
PGC information #2, VOBUs #i-l, i, and i+1 located at 

25 the end of cell#l and VOBUs #j, and j+2 located at 
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the start of cell#2 will be subjected to re-encoding. 
This re-encoding is performed according to the procedure 
described in the first and second embodiments, and the 
linking of the extents is then performed according to 
5 the procedure described in the third embodiment. 

Fig. 7 4A shows the ECC blocks on the DVD-RAM that 
are freed by a real edit performed using user-defined 
PGC information #2. As shown on the second level of 
Fig. 74A, VOBUs #i, #i+l, and #i+2 are recorded in the 

10 AV block #m, and VOBUs #j, and #j+2 are recorded 

in the AV block #n. 

As shown in Fig. 73, cell #2 indicates the picture 
data included in VOBU #i+l as the C_V_S_PTM, and the 
picture data included in VOBU as the C_V_E_PTM. As 

15 a result, a SPLIT command and a SHORTEN command of the 
second embodiment are issued to free the area from the 
ECC block occupied by VOBU #i+2 to the ECC block 
occupied by VOBU #j, as shown by the frames wl3 and wl4 
in Fig. 74A. However, the ECC blocks occupied by VOBUs 

20 #i and #i+l and the ECC blocks occupied by VOBUs #j+l 
and j+2 are not freed. 

Fig. 7 4B shows an example of a VOB, VOB information 
and PGC information after a real edit. Since the area 
corresponding to cell #2 have been deleted, VOB #1 is 

25 deleted into (new) VOB#l and VOB#2. 
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When the SPLIT command is issued, the VOB 
information for V0B#1 is divided into VOB information #1 
and VOB information #2. The time map tables included in 
this VOB information is also divided into the time map 
table #1 and the time map table #2. Although not 
illustrated, the seamless linking information is also 
divided. 

The VOBUs in V0B#1 and VOB#2 are referred to by a 
reproduction apparatus via these divided time map 
tables . 

The user-defined PGC information and original PGC 
information have the same data construction, with only 
the value of the title types differing. The sections of 
VOBs obtained after a real edit were originally 
indicated by the user-defined PGC information #2 before 
the real edit, so that the user-defined PGC information 
#2 is converted into original PGC information. Since 
this user-defined PGC information #2 is used to define 
the original information, there is no need for a 
separate process to generate new original PGC data after 
a real edit. 

(4-2) Functional Blocks of the DVD Recorder 70 

Fig. 75 is a functional block diagram showing the 
construction of the DVD recorder 70 in this fourth 
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embodiment. Each function shown in Fig. 75 is realized 
by the CPU la executing the programs in the ROM le and 
controlling the hardware shown in Fig. 17. 

The DVD player shown in Fig. 75 is composed of a 
5 disc recording unit 100, a disc reading unit 101, a 

common file system unit 10, an AV file system unit 11, 
and an recording-editing-reproduction control unit 12, 
in the same way as in video data editing apparatus 
described in the third embodiment. The present 

10 embodiment differs with the third embodiment, however, 
in that the AV data recording unit 13 is replaced with 
the title recording control unit 22, the AV data 
reproduction unit 14 is replaced with the title 
reproduction control unit 23, and the AV data editing 

15 unit 15 is replaced with the editing multi-stage control 
unit 26. This DVD player also includes a PGC 
information table work area 21, an RTRW management file 
work area 24, and a user-defined PGC information 
generator 25, in place of the def ragmentation unit 16. 

20 

(4-2-1) Recording-Editina-Reoroduction Cont rol Unit 12 
The recording-editing-reproduction control unit 12 
in this fourth embodiment receives a user indication of 
a directory in the directory structure on the DVD-RAM as 
25 the operation target. On receiving the user indication 
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of the operation target, the recording-editing- 
reproduction control unit 12 specifies the operation 
content according to the user operation that has been 
reported by the remote control signal reception unit 8. 
5 At the same time, the recording-editing-reproduction 
control unit 12 gives instructions so that processing 
corresponding to the operation content is performed for 
the directory that is the operation target by the title 
recording control unit 22, the title reproduction 

10 control unit 23, or any of the other components. 

Fig. 77A shows an example of graphics data that is 
displayed on the TV monitor 72 under the control of the 
recording-editing-reproduction control unit 12. When 
any of the directories has been set into the focus 

15 state, the recording-editing-reproduction control unit 
12 waits for the user to press the enter key. When the 
user does so, the recording-editing-reproduction control 
unit 12 specifies the directory that is presently in the 
focus state as the current directory. 

20 

(4-2-2) PGC Informa tion Table Work Area 21 

The PGC information table work area 21 is a memory 
area that has a standardized logical format so that sets 
of PGC information can be successively defined. This 
25 PGC information table work area 21 has internal regions 

234 



WO 99/14935 



PCT/JP98/04174 



that are managed as a matrix. The plurality of sets of 
PGC information that are present in the PGC information 
table work area 21 are arranged in different columns 
while a plurality of sets of cell information are 
5 arranged in different rows. In the PGC information 
table work area 21, any set of cell information in a 
stored set of PGC information can be accessed using a 
combination of a row number and a column number. 

Fig. 7 6 shows examples of sets of original PGC 

10 information that are stored in PGC information table 
work area 21. It should be noted here that when the 
recording of an AV file is completed, the user-defined 
PGC information table will be empty (shown as "NULL" in 
Fig. 76"). In Fig. 76, the original PGC information #1 

15 includes the set of cell information #1 showing the 

section between the start time tO and the end time tl, 
the set of cell information #2 showing the section 
between the start time tl and the end time t2, the set 
of cell information #3 showing the section between the 

20 start time t2 and the end time t3, and the set of cell 
information #4 showing the section between the start 
time t3 and the end time t4. 



(4-2-3) Title Recording Control Unit 22 
25 The title recording control unit 22 records VOBs 
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onto the DVD-RAM in the same way as the AV data 
recording unit 13 in the third embodiment, although in 
doing so the title recording control unit 22 also stores 
a time map table in the RTRW management file work area 
5 24, generates VOB information, and generates original 
PGC information which it stores in the PGC information 
table work area 21. 

When generating original PGC information, the title 
recording control unit 22 follows the procedure 

10 described below. First, on receiving notification of a 
pressing of the record key from the recording-editing- 
reproduction control unit 12, the title recording 
control unit 22 secures a row area in the PGC 
information table work area 21. Next, after the AV data 

15 recording unit 13 has assigned an AV file identifier and 
a VOB identifier to the VOB to be newly recorded, the 
title recording control unit 22 obtains these 
identifiers and stores them in the secured row area 
corresponding to a newly assigned PGC number. 

20 Next, when encoding is started for the VOB, the 

title recording control unit 22 instructs the MPEG 
encoder 2 to output the PTS of the first video frame. 
When the encoder control unit 2g has outputted this PTS 
for the first video frame, the title recording control 

25 unit 22 stores this value and waits for the user to 
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perform a marking operation. 

Fig. 8 OA shows how data input and output are 
performed between the components shown in Fig. 7 5 when a 
marking operation is performed. While viewing the video 
5 images displayed on the TV monitor 72, the uses presses 
the mark key on the remote controller 71. This marking 
operation is reported to the title recording control 
unit 22 via the route shown as ©, ©, (D in Fig. 80A. 
The title recording control unit 22 then obtains the PTS 

10 for the point where the user pressed the mark key from 
the encoder control unit 2g, as shown by ® in Fig. 80A, 
and sets this as time information. 

The title recording control unit 22 repeatedly 
performs the above processing while a VOB is being 

15 encoded. If the user presses the stop key during the 

generation of the VOB, the title recording control unit 
22 instructs the encoder control unit 2g to output the 
presentation end time for the last video frame to be 
encoded. Once the encoder control unit 2g has outputted 

20 this presentation end time for the last video frame to 
be encoded, the title recording control unit 22 stores 
this as time information. 

By repeating the above processing until the 
encoding of a VOB is complete, the title recording 

25 control unit 22 ends up storing the AV file identifier, 
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the VOB identifier, the presentation start time of the 
first video frame, the presentation start time of each 
video frame corresponding to a point where a marking 
operation was performed, and the presentation end time 
5 of the final video frame. 

Of this stored time information, the title 
recording control unit 22 sets the start time and end 
time of a section and the corresponding AV file 
identifier and VOB identifier as one set of cell 

10 information which it stores in a newly-secured row in 
the PGC information table work area 21. By doing so, 
the title recording control unit 22 newly generates 
original PGC information. 

On completing the above generation, the title 

15 recording control unit 22 associates this original PGC 
information to the assigned PGC number, and, in the PGC 
information table work area 21, generates a title search 
pointer that has type information showing that this PGC 
information is original PGC information, and a title 

20 recording history showing the date and time at which the 
recording of this PGC information was completed. 

It should be noted here that if the title 
reproduction control unit 23 can detect when there is a 
large change in the content of scenes, the user-defined 

25 PGC information generator 25 may automatically obtain 
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the PTS for the points at which such scene changes occur 
and automatically set these PTS in sets of cell 
information. 

The generation of a time map table or VOB 
5 information does not form part of the gist of this 
embodiment, and so will not be explained. 

(4-2-4) Title Reproduction Control Unit 23 

The title reproduction control unit 23 performs 
10 reproduction or partial reproduction for any of the 
titles recorded in the current directory that is 
indicated by the recording-editing-reproduction control 
unit 12. 

This is described in more detail below. When, as 
15 shown in Fig. 77A, one of the directories is selected as 
the current directory and the user gives an indication 
for the reproduction of one of the title stored in this 
directory, the title reproduction control unit 23 
displays the screen image shown in Fig. 77A, reads the 
20 original PGC information table and user-defined PGC 

information table in the RTRW management file in this 
directory, and has the user select the complete 
reproduction or partial reproduction of one of the 
original PGCs or user-defined PGCs in the current 
25 directory. Fig. 77B shows the PGCs and cells that are 
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displayed as the list of potential operation targets. 
The sets of PGC information and cell information that 
represent these PGCs and cells are the same as those 
shown in the example of Fig. 76. 
5 The original PGCs that appear in this interactive 

screen are shown in a simple graph that shows time in 
the horizontal axis, with the each original PGC being 
displayed along with the date and time at which it was 
recorded. In Fig. 77B, the menu at the bottom right of 

10 the screen shows whether complete reproduction or 

partial reproduction is to be performed for the video 
title in the current directory. By pressing the "1" or 
"2" key on the remote controller 71, the user can select 
complete reproduction or partial reproduction or the 

15 video title. If the user selects complete reproduction, 
the title reproduction control unit 23 has the user 
select one of the PGCs as the operation target, while if 
the user selects partial reproduction, the title 
reproduction control unit 23 has the user select one of 

20 the cells as the operation target. 

When complete reproduction has been selected for a 
PGC, the title reproduction control unit 23 extracts the 
cells from the PGC selected as the operation target and, 
by referring a time map table such as that shown in Fig. 

25 71, reproduces the sections indicated by the cells one 
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by one. On completing the reproduction of the sections, 
the title reproduction control unit 23 has the 
interactive screen shown in Fig. 77B displayed and waits 
for the next selection of cell information. 
5 Fig. 78A is a flowchart showing the processing when 

partially reproducing sets of cell information. First, 
in step S271, the title reproduction control unit 23 
reads the C__V_S_PTM and C_V_E_PTM from the cell 
information to reproduced out of the original PGC 

10 information or user-defined PGC information. Next, in 
step S272, the title reproduction control unit 23 
specifies the address of the VOBU (START) that includes 
the picture data assigned C__V_J3_PTM. 

In step S273, the title reproduction control unit 

15 23 specifies the address of the VOBU (END) that includes 
the picture data assigned C_V_E_PTM, and in step S27 4, 
the title reproduction control unit 23 reads the section 
from VOBU (START) to VOBU (END) from the present VOB. 
In step S275, the title reproduction control unit 23 

20 instructs the MPEG decoder 4 to decode the read VOBUs . 
In step S276, the title reproduction control unit 23 
outputs the cell presentation start time (C_V_S_PTM) and 
cell presentation end time (C_V_E_PTM) to the decoder 
control unit 4k of the MPEG decoder 4 as valid 

25 reproduction section information, together with a decode 
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processing request . 

The reason the title reproduction control unit 23 
outputs the valid reproduction section information to 
the MPEG decoder 4 is that the decoder control unit 4k 
5 in the MPEG decoder 4 will try to decode even picture 
data that is not within the section indicated by the 
cell. In more detail, the unit for the decode 
processing of the MPEG decoder 4 is a VOBU, so that the 
MPEG decoder 4 will decode the entire section from 
10 VOBU (START) to VOBU (END), and in doing so will have 

picture data outside the section indicated by the cell 
reproduced. A cell indicates a section in units of 
video fields, so that a method for prohibiting the 
decoding and reproduction of picture data outside the 
15 section is necessary. To prohibit the reproduction of 
such picture data, the title reproduction control unit 
23 outputs valid reproduction section information to the 
title reproduction control unit 23. Fig. 78B shows how 
only the section between the cell presentation start 
20 time (C_V_S__PTM) and the cell presentation end time 

(C_V_E_PTM) , out of the area between the VOBU (START) 
and the VOBU (END) , is reproduced. 

By receiving this valid reproduction section 
information, the MPEG decoder 4 can stop the display 
25 output of an appropriate number of video fields from the 
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start of the VOBU (START) to C_V_S__PTM and the display 
output of an appropriate number of video fields from 
C_V_E_PTM to the VOBU (END) . For the hardware 
construction shown in Fig. 17, the disc access unit 3 
5 reads the VOBU sequence and outputs this to the MPEG 
decoder 4 via the logical connection (1) . The MPEG 
decoder 4 decodes this VOBU sequence and prohibits the 
reproduction output of the part that precedes C_V_S_PTM 
and the part that follows C__V_E_PTM. As a result, only 
10 the section indicated by the cell information is 
reproduced . 

Since one set of original PGC information or user- 
defined PGC information includes a plurality of sets of 
cell information, the procedure shown in Fig. 7 8A may be 
15 repeated for each set of cell information included in 
one set of PGC information. 

(4-2-5) RTRW Management File Work Area 24 

The RTRW management file work area 2 4 is a work 

20 area for arranging the original PGC information table 
composed of the plurality of sets of original PGC 
information generated in the PGC information table work 
area 21, the user-defined PGC information table composed 
of a plurality of sets of user-defined PGC information, 

25 the title search pointers, and the sets of VOB 
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information, in accordance with the logical format shown 
in Fig. 70. The common file system unit 10 writes the 
data arranged in the RTRW management file work area 2 4 
into the RTRW directory as non-AV files, and in doing so 
5 stores a RTRW management file in the RTRW directory. 

(4-2-6) User-Defined PGC In formation Generator 25 
The user-defined PGC information generator 25 
generates user-defined PGC information based on one set 

10 of PGC information recorded in the RTRW management file 
of the current directory. Two types of cell information 
can be present in the user-defined PGC information 
(called sets of user-defined cell information) , with 
these being a first type that indicates an area inside a 

15 section indicated by cell information in an existing set 
of PGC information, and a second type that indicates the 
same section as a set of cell information in an existing 
set of PGC information. The user-defined PGC 
information generator 25 generates these two types of 

20 cell information using different methods. 

To generate the first type of user-defined cell 
information that indicates an area inside a section 
indicated by existing cell information, the user-defined 
PGC information generator 25 has the title reproduction 

25 control unit 23 perform partial reproduction for the 
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section indicated by the existing cell information. 
During the partial reproduction for this section, the 
user-defined PGC information generator 25 monitors when 
the user performs marking operations, and generates sets 
5 of cell information with the times of the marking 

operations as the start point and end point. In this 
way, the user-defined PGC information generator 25 
generates user-defined PGC information composed of this 
first type of cell information. 

10 Figs. 7 9A and 7 9B show how the user uses the TV 

monitor 72 and remote controller 71 when generating 
user-defined PGC information. Fig. 80B shows the data 
input and output between the components shown in Fig. 75 
when a marking operation is performed. As shown in Fig. 

15 7 9A, the user views the video images displayed on the TV 
monitor 72 and presses the mark key on the remote 
controller 71 at the beginning of a desired scene. 
After this, the desired scene ends, as shown in Fig. 
79B, and the video images change to a content in which 

20 the user has no interest. Accordingly, the user presses 
the mark key again. 

This marking operation is reported to the user- 
defined PGC information generator 25 via the route shown 
as ®, (D, ® in Fig. 80B. The user-defined PGC 

25 information generator 25 then obtains the PTS of the 
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points when the user pressed the mark key from the MPEG 
decoder 4, as shown by ® in Fig. 80B, and stores the PTS 
as time information. The user-defined PGC information 
generator 25 then generates a set of cell information by 
5 attaching the appropriate AV file identifier and VOB 
identifier to a pair of stored PTS that are the start 
point and end point of a section, and stores this cell 
information in a newly secured row area the PGC 
information table work area 21, as shown by <D in Fig. 
10 80B. 

When generating user-defined PGC information that 
indicates a section indicated by an existing set of cell 
information, the user-defined PGC information generator 
25 merely copies the existing cell information into a 
15 different row area in the PGC information table work 
area 21. 

In more detail, the user-defined PGC information 
generator 25 secures a row area for one row in the RTRW 
management file work area 24, and assigns a new user- 
20 defined PGC information identifier to this row area. 

Once the cell information that should be used in 
the present user-defined PGC information has been 
indicated, out of the sets of cell information in the 
PGC information already stored in the PGC information 
25 table work area 21, using a combination of a row number 
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and a column number, the user-defined PGC information 
generator 25 reads the cell information and copies it 
into a newly secured row area in the PGC information 
table work area 21. 

(4-2-7) Editing Multi-Stage Control Unit 26 

The editing multi-stage control unit 26 controls 
the title reproduction control unit 23 , the user-defined 
PGC information generator 25, and the seamless linking 
unit 20 to perform a multi-stage editing process 
including : 

1. virtual edits achieved by defining user-defined 
PGC information; 

2. previews which allow the user to view the video 
images that would be obtained by a real edit, based on 
the result of a virtual edit; 

3. seamless linking, as described in the first and 
second embodiments; and 

4 . real edits performed by linking AV files as 
described in the third embodiment. 

(4-2-7-1) Procedure for Multi-Staae Editing by the 

Editing Mult i-Stage Control Unit 26 
The following is a description of the specific 
procedure for the multi-stage control performed by the 
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editing multi-stage control unit 26. When the user 
selects a virtual edit using the remote controller 71 in 
response to the interactive screen shown in Fig. 77A, 
the editing multi-stage control unit 26 accesses the 
5 RTRW directory, has the common file system unit 10 read 
the RTRW management file from the RTRW directory, and 
has the RTRW management file stored in the RTRW 
management file work area 24. Next, out of the RTRW 
management file stored in the RTRW management file work 

10 area 24, the editing multi-stage control unit 26 

transfers the original PGC information table, the user- 
defined PGC information table, and the title search 
pointers to the PGC information table work area 21, and 
transfers the time map table to the time map table work 

15 area. 

Based on the transferred original PGC information 
table, the editing multi-stage control unit 26 displays 
the interactive screen shown in Fig. 85, and waits for 
the next user indication. 
20 Fig. 85 shows an example of the interactive screen 

displayed by the TV monitor 72 to have the user select 
the sections for the cells of a user-defined PGC in a 
virtual edit. 

This interactive screen displays the original PGCs 
25 and user-defined PGCs as simple graphs, where the 
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horizontal axis represents time. The recording date and 
time of each original PGC and user-defined PGC is also 
displayed. This interactive screen displays the 
plurality of cells as a horizontal arrangement of 
5 rectangles. The user may select any of these rectangles 
using the cursor keys on the remote controller 71. 
These original PGCs and cells are the same as those 
shown in Fig. 76, and the following describes the 
updating of the original PGC information table, the 

10 user-defined PGC information table and the title search 
pointers with Fig. 76 as the initial state. 

Fig. 81 is a flowchart showing the processing of 
the editing multi-stage control unit 2 6 when defining a 
user-defined PGC. In this flowchart, the variable j 

15 indicates one of the plurality of original PGCs that are 
arranged vertically in the interactive screen and the 
variable k indicates one of the plurality of cells that 
are arranged horizontally in the interactive screen. 
The variable m is the PGC number that should be 

20 assigned to the set of user-defined PGC information that 
is being newly defined in the RTRW management file, and 
the variable n is the cell number that should be 
assigned to the set of cell information that is being 
newly defined in the RTRW management file. 

25 In step S201, the editing multi-stage control unit 
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2 6 substitutes a value given by adding one to the last 
number of the original PGC information in the RTRW 
management file into the variable m and "1" into the 
variable n. In step S202, the editing multi-stage 
5 control unit 2 6 adds a space for the mth user-defined 
PGC information to the user-defined PGC information 
table and in step S203, the editing multi-stage control 
unit 2 6 waits for the user to make a key operation. 
Once the user has made a key operation, in step S204 the 

10 editing multi-stage control unit 26 sets the flag for 

the pressed key, out of the flags that correspond to the 
keys on the remote controller 71, at "1", and in step 
S205 judges whether the Enter_Flag, which shows whether 
the enter key has been pressed, is "1". In step S206, 

15 the editing multi-stage control unit 2 6 judges whether 
the End_Flag, which shows whether the end key has been 
pressed, is "1". When both these flags are "0", the 
editing multi-stage control unit 2 6 uses the Right_Flag s 
Left_Flag N Down_Flag N Upper_Flag, which respectively show 

20 whether the right, left, down, or up keys have been 

pressed, to perform the following calculations, before 
substituting the calculation results into the variables 
k and j . 

k-k+1* (Right_Flag) -1* (Left_Flag) 
25 j-j+1* (Down_Flag) -1* (Up_Flag) 
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When the right key has been pressed, the Right_Flag 
is set at "1", so that the variable k is incremented by 
"1". When the up key has been pressed, the Up__Flag is 
set at "1", so that the variable j is incremented by 
5 "1". Conversely, when the left key has been pressed, 
the Left_Flag is set at "1", so that the variable k is 
decremented by "1". In the same way, when the down key 
has been pressed, the Down_Flag is set at "1", so that 
the variable j is decremented by "1". 

10 After updating the values of the variables k and j 

in this way, the editing multi-stage control unit 2 6 has 
the cell representation in row j and column k displayed 
in the focus state in step S208, clears all of the flags 
assigned to keys on the remote controller 71 to zero in 

15 step S209, and returns to step S203 where it waits once 
again for a key operation. By repeating the procedure 
in steps S203 to S209 described above, the focus state 
can move up/down and left/right among the cells 
according to key operations made using the remote 

20 controller 71. 

If the user presses the enter key with any of the 
cells in the focus state during the above processing, 
the editing multi-stage control unit 26 proceeds to step 
S251 in Fig. 82. 

25 In step S251 of Fig. 82, the editing multi-stage 
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control unit 2 6 has the user give an indication as to 
whether the cell information in row j and column k 
should be used as it is, or whether only an area within 
the section indicated by this cell information is to be 
5 used. When the cell information is to be used as it is, 
the editing multi-stage control unit 26 copies the cell 
representation in row j and column k to the space given 
as row m and column n in step S252, and defines 
Original__PGC# j .CELL#k as User_Def ined_PGC#m. CELL#n in 
10 step S253. After this defining, in step S254 the 
editing multi-stage control unit 2 6 increments the 
variable n and proceeds to step S209 in Fig. 81. 

When an area within the section indicated by this 
cell information in row j and column k should be used, 
15 the editing multi-stage control unit 26 proceeds to step 
S255 to have the title reproduction control unit 23 
commence partial reproduction for the cell information 
in row j and column k. 

In step S255, the editing multi-stage control unit 
20 2 6 determines the circumstances for the reproduction of 
the cell information in row j and column k. This 
determination is performed since when the section 
indicated by this cell information has been reproduced 
in part, there is no need to reproduce the section once 
25 again from the start, with it being preferable in this 
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case for the reproduction of the section indicated by 
the cell information in row j and column k to commence 
at the position where the previous reproduction was 
terminated (Step S266) , this point being called the 
5 reproduction termination point t. 

On the other hand, when the cell information in row 
j and column k has not been reproduced, the section 
indicated by the cell information in row j and column k 
is reproduced from the start in step S2 65, with the 

10 processing then returning to steps S256 and entering the 
loop formed of steps S256 and S257. Step S256 waits for 
the reproduction of the cell to end, while step S257 
waits for the user to press the mark key. When the 
judgement "Yes" is given in step S257, the processing 

15 advances to step S258, where the time information for 
the pressing of the mark key is obtained, and then to 
step S259. 

In step S259, the editing multi-stage control unit 
2 6 judges whether two sets of time information have been 
20 obtained. If not, the processing returns to step S256, 
or if so, the processing advances to step S2 60 where the 
obtained two sets of time information are set as the 
start point and end point. 

One of the sets of time information obtained here 
25 is the start of the video scene which was marked by the 
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user during its display on the TV monitor 72, while the 
other set of time information is the end of this video 
scene. These sets of time information are interpreted 
as marking a section in the original PGC which is 
5 especially wanted by the user as material for a video 

edit. Accordingly, user-defined PGC information should 
be generated this section, so that cell information is 
generated in the PGC information table work area 21. 
The processing then advances to step S2 61. 
10 In step S261, the user-defined PGC information 

generator 25 obtains the VOB_ID and AV file ID in 
Original_PGC#j .CELL#k. In step S262, the user-defined 
PGC information generator 25 generates 

UserJDef ined__PGC#m. CELL#n using the obtained start point 
15 and end point, VOB_ID, and AV file ID. In step S2 63, 

the end point information is stored as the reproduction 
termination point t and in step S254, the variable n is 
incremented, before the processing returns to step S209. 
As a result of the above processing, new user- 
20 defined cell information is generated from the cell 

information in row j and column k. After this, another 
cell is set into the focus state and another set of 
user-defined cell information is generated from this 
cell, so that a set of user-defined PGC information is 
25 gradually defined one cell at a time. 
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It should be noted here that if the reproduction 
based on the cell information in row j and column k in 
the loop process shown as step S256 to step S257 ends 
without a marking operation having been made, the 
5 processing will return to step S254. 

When it is determined that the end key has been 
pressed, the judgement "Yes" is given in step S206 in 
Fig. 80B and the processing advances to step S213. In 
step S213, a menu is displayed to have the user indicate 
10 whether a next user-defined PGC is to be defined. When 
the user wishes to define a new user-defined PGC and 
gives an indication of such, in step S214 the variable m 
is incremented, the variable n is initialized and the 
processing proceeds to steps S209 and S203. 

15 

(4-2-7-2) Specific Example of the 

Definiti on of User-Defined PGC Information 
The following is a description of the operation 
when defining user-defined PGC information from a 
20 plurality of sets of original PGC information that are 
displayed in the interactive screen image of Fig. 85. 

Figs. 8 6A and 8 6B show the relationship between the 
user operations made via the remote controller 71 and 
the display processing that accompanies the various user 
25 operations. Fig. 87A through Fig. 90 also illustrate 
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examples of such operations, and are referred to in the 
following explanation of these operations. 

As shown in Fig. 85, once the cell #1 which is in 
row 1 and column 1 has been set in the focus state, the 
5 user presses the enter key, as shown in Fig. 8 6B. As a 
result, the judgement "Yes" is given in step S205 and 
the processing proceeds to the flowchart in Fig. 82. In 
steps S251 to S266 of the flowchart in Fig. 82, the 
first cell information CELL #1A in the user-defined PGC 
10 #1 is generated based on the Original__PGC#l . CELL#1 shown 
in Fig. 86A. Once this generation is complete, the 
variable n is incremented in step S254, and the 
processing returns to step S203 via step S209 with the 
value of the variable n at "2". In this example, the 
15 user presses the down key once, as shown in Fig. 87B, 

and the right key twice, as shown in Figs. 87C and 87D. 
In step S204, the flags that corresponds to the keys 
that have been pressed are set at "1". 

As a result of the first press of the down key: 
20 k=l (=1+1*0-1*0) 

j=2 (=1+1*1-1*0) 
As a result of the first press of the right key: 

k=2 (=1+1*1-1*0) 
j=2 (=2+1*0-1*0) 

25 As a result of the second press of the right key: 
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k=3 (=2+1*1-1*0) 
j=2 (2+1*0-1*0) 

As shown in Fig. 87A, the cell #7 located in row 2 
5 and column 3 is set in the focus state. 

Once the cell in row 2 and column 3 has been set in 
the focus state, the user presses the enter key, as 
shown in Fig. 8 8B, so that the judgement "Yes" is given 
in step S205 and the processing advances to the 

10 flowchart in Fig. 82. The cell information #7A, which 
is the second set of cell information in 
UserDef ined_PGC#l, is then generated based on the 
Original_PGC#2 .CELL#7 located in row 2 and column 3 of 
the original PGC information table (see Fig. 88A) . 

15 After the second set of cell information has been 

generated, the above processing is repeated. The user 
presses the enter key as shown in Fig. 89B, so that the 
cell information #11A and the cell information #3A are 
respectively generated as the third and fourth sets of 

20 cell information in UserDef ined_PGC#l . 

The processing returns to step S203 and, in the 
present example, the user then presses the end key. As 
a result, the End_Flag corresponding to the end key is 
set at "1", and the processing advances to step S213. 

25 Since the end key has been pressed, the editing multi- 
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stage control unit 2 6 regards the definition of the 
user-defined PGC information #1 as complete. In step 
S213, the user is asked to indicate whether he/she 
wishes to define another set of user-defined PGC 
5 information (the user-defined PGC information #2) that 
follows this defined user-defined PGC information #1. 
If the user wishes to do so, the variable m is 
incremented, the variable n is initialized, and the 
processing proceeds to step S209. 

10 By repeating the above processing, the user-defined 

PGC information #2 and the user-defined PGC information 
#3 are defined. As shown in Fig. 91, this user-defined 
PGC information #2 is composed of cell #2B, cell #4B, 
cell #10B, and cell #5B, and the user-defined PGC 

15 information #3 is composed of cell #3C, cell #6C, cell 
#8C, and cell #9C. 

Fig. 91 shows the contents of the user-defined PGC 
information table, the original PGC information table 
and the title search pointers at the end of the virtual 

20 edit process. 

If the user presses the end key at this point, the 
interactive screen shown in Fig. 90 will be displayed in 
step S215 in Fig. 81, and the editing multi-stage 
control unit 26 waits for the user to select a set of 

25 user-defined PGC information using the up and down keys. 
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Here, the user can select a preview by pressing the play 
key , and can select a real edit by pressing the real 
edit key, with the user-defined PGC information table 
not being recorded yet. 
5 If the user gives indication for an operation that 

records a user-defined PGC, the user-defined PGC 
information table that includes the new user-defined PGC 
generated in the PGC information table work area 21 is 
transferred to the RTRW management file work area 24, 
10 where it is written into the part of the RTRW management 
file written in the RTRW management file work area 2 4 
that corresponds to the user-defined PGC information 
table . 

At the same time, file system commands are issued 
15 so that a title search pointer for the newly generated 
user-defined PGC information is added to the title 
search pointers that are already present in the RTRW 
management file transferred to the RTRW management file 
work area 24. 

20 Fig. 83 is a flowchart showing the processing 

during a preview or a real edit. The following is a 
description of the processing when performing a preview 
of a VOB linking operation, with reference to this 
flowchart in Fig. 83. 

25 Figs. 92A-92B and 93A-93C show the relationship 
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between operations made using the remote controller 71 
and the display processing that accompanies these 
operations . 

In step S220 of the flowchart of Fig. 83, the first 
5 number in the user-defined PGC information table is 

substituted into the variable j, and step S221, a key 
operation is awaited* When the user makes a key 
operation, in step S222 the flag corresponding the key 
pressed by the user is set at "1". 

10 In step S223, it is judged whether the Play__Flag, 

which shows whether the play key has been pressed, is 
"1", and in step S22 4, it is judged whether the 
RealEdit_Flag, which shows whether the real edit key has 
been pressed, is "1". When both these flags are "0", 

15 the processing proceeds to step S225 where the following 
calculation is performed using the values of the Up_Flag 
and Down_Flag that respectively show whether the up and 
down keys have been pressed. The results of this 
calculation is substituted into the variable j . 

20 

(Down_Flag) -1* (Up_Flag) 



When the user has pressed the up key, the Up_Flag 
will be set at "1", meaning that the variable j is 
25 decremented. Conversely, the user has pressed the down 
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key, the Down__Flag will be set at "1", meaning that the 
variable j is incremented. Once the variable j has been 
updated in this way, in step S226 the image on the 
display corresponding to the PGC information positioned 
5 on row j is set in the focus state. In step S227, all 
of the flags corresponding to keys on the remote 
controller 71 are cleared to zero and the processing 
returns to step S221 where another key operation is 
awaited. This processing in steps S221 to S227 is 

10 repeated, with the focus state moving to a different set 
of PGC information in accordance with user operations of 
the up and down keys on the remote controller 71. 

If the user presses the play key, during the above 
processing is being repeated, with one of the sets of 

15 PGC information in the focus state, the Play_Flag is set 
at "1", the judgement "Yes" is given in step S223, and 
the processing proceeds to S228 . In step S228, the 
editing multi-stage control unit 26 instructs the title 
reproduction control unit 23 to reproduce the VOBs in 

20 accordance with the PGC, out of the user-defined PGCs, 
that has been indicated by the user. 

When the PGC indicated by the user is a user- 
defined PGC, the cells included in the user-defined PGC 
will indicate sections out of the plurality of section 

25 in one or more VOBs in a user-defined order. Since such 
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reproduction will not satisfy the necessary conditions 
for seamless reproduction that were described in the 
first and second embodiments, so that image display and 
output will be stopped at the boundary of a cell during 
5 reproduction before advancing to the next cell. Since 
the necessary conditions for seamless reproduction of 
cells are not satisfied, image display and audio display 
will be interrupted. However, the object of this 
operation is only to give the user a preview of the 
10 linking result for a plurality of scenes, so that this 
object is still achieved regardless of such 
interruptions . 

(4-2-7-3) Processing for a Previe w of a 

15 Multi-Staae Edit and for a Real Edit 

The operation for the linking of VOBs in a real 
edit is described below. 

Figs. 94A to 94C show the relationship between user 
operations of the remote controller 71 and the display 

20 processing that accompany these key operations. The 
user presses the up key as shown in Fig. 94B to have 
cell #1A set into the focus state, and this is reflected 
in the display screen displayed on the TV monitor 72 as 
shown in Fig. 94A. If the user then presses the real 

25 edit key, as shown in Fig. 94C, the judgement "Yes" is 
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made in step S224 in Fig. 83, and the processing from 
step S8 to step S16 in the flowchart of Fig. 43 
described in the third embodiment is performed. 

After completing this processing in the third 
5 embodiment, the processing advances to step S237 in Fig. 
84. After the variable n is set at "1" in step S237, a 
search is performed for the Original_PGC# j . CELL#k which 
was used when generating the UserDef ined_PGC#m. CELL#n in 
step S238 and in step S239 it is judged whether this 

10 Original_PGC# j exists. If so, this Original__PGC# j is 
deleted in step S240, or if not, a search is performed 
for the UserDef ined_PGC#q that was generated from this 
Original_PGC# j in step S240. 

In step S242, it is determined whether there is at 

15 least one such UserDef ined_PGC#q, and if so, all such 
UserDef ined_PGC#q are deleted in step S243. In step 
S244, it is judged whether the value of variable n 
matches the last number of the cell information, and if 
not, the processing advances to step S245 where the 

20 variable n is incremented to indicate the next set of 
cell information in the PGC information #q before the 
processing returns to step S238. The loop process in 
step S238 to step S245 is repeated until the variable n 
reaches the last number of the cell information in the 

25 PGC information #q. 
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The sections indicated by the user-defined PGC 
information #1 are all of VOBs #1, #2, and #3, so that 
these are all subjected to the real edit. The sets of 
original PGC information that were used to generate the 
5 cell information included in user-defined PGC 

information #1 indicate VOBs that are subjected to the 
real edit, so that all of these sets of original PGC 
information are deleted. The sets of user-defined PGC 
that were generated from these sets of PGC information 

10 also indicate VOBs that are subjected to the real edit, 
so that all of these sets of user-defined PGC 
information are also deleted. 

The judgement "Yes" is made in step S244, so that 
the processing advances to step S246, and, out of the 

15 freed PGC numbers obtained by deleting the sets of 

original PGC information, the lowest number is obtained 
as the PGC number #e. Next, in step S247, the cell 
information is updated using the AV file ID assigned to 
the AV file and the VOB_ID after the MERGE command, and 

20 in step S248 the PGC number of the UserDef ined__PGC#q is 
updated to the PGC number #e. In the title search 
pointers, meanwhile, the type information is updated to 
the original type. 

Fig. 95 shows examples of the PGC information table 

25 and the title search pointers after the deletion of sets 
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of original PGC information and user-defined PGC 
information that accompanies a real edit. 

Since the VOBs #1, #2, and #3 indicated by the 
sections in user-defined PGC information #1 are 
5 subjected to the real edit, the original PGC information 
#1, the original PGC information #2, the original PGC 
information #3, the user-defined PGC information #2, and 
the user-defined PGC information #3 will already have 
been deleted. Conversely, what was formerly the user- 

10 defined PGC information #1 has been defined as the 
original PGC information #1. 

Once the PGC information has been updated in the 
PGC information table work area 21 as described above, 
the new original PGC information is transferred to the 

15 RTRW management file work area 2 4 where it is used to 
overwrite the RTRW management file presently stored in 
the RTRW management file work area 24. At the same 
time, the title search pointer for this newly generated 
original PGC information is transferred to the RTRW 

20 management file work area 24 where it is used to 

overwrite the title search pointers already present in 
the RTRW management file. 

Once the user-defined PGC information table and 
title search pointers have been written, file system 

25 commands are issued so that the RTRW management file 
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stored in the RTRW management file work area 2 4 is 
written into the RTRW directory. 

With this present embodiment, the sections to be 
used as materials for a real edit are indicated by user- 
5 defined cell information, with these being freely 

arranged to provisionally decide the reproduction route. 

When the user wishes to set a reproduction route of 
the editing materials, this can be achieved without 
having to temporarily produce a VOB, so that the editing 

10 of video materials can be performed in a short time 

using a simple method. This also means that there is no 
need to use more of the storage capacity of the DVD-RAM 
to store a temporarily produced VOB. 

If the provisional determination of scene linking 

15 can be achieved by merely defining a set of user-defined 
PGC information, the user can produce many variations of 
the reproduction route in a short time. The sets of 
user-defined cell information are indicated using time 
information for sections in VOBs, so that the indicated 

20 VOBs can be maintained in the state in which they were 
already recorded. 

The user can generate a plurality of sets of user- 
defined PGC information for different reproduction 
routes and then view previews of these routes to find 

25 the most suitable of these reproduction routes. The 
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user can then indicate a real edit for his/her preferred 
reproduction route, and so process the VOBs in 
accordance with the selected user-defined PGC 
information. This means that the user can perform a 
5 bold editing process that directly rewrites the VOBs 

that are already stored on an optical disc. While the 
original VOBs will be effectively deleted from the disc, 
the user is able to verify the result of this before 
giving the real edit indication, making this not a 

10 particular problem for the present invention. 

Once a real edit has been performed, the title type 
in the title search pointer of the user-defined PGC 
information used for the real edit will be set to 
"original type PGC information", so that this can be 

15 used as the base for following video editing operations. 

As described above, a single video data editing 
apparatus that uses only one optical disc can perform 
advanced video editing whereby a user can select one out 
of a plurality of freely chosen potential arrangements 

20 of the source material. As a result, by using the 

present video data editing apparatus, a large number of 
video enthusiasts will be able to perform advanced 
editing operations that were considered out of the reach 
of conventional domestic video equipment. 

25 It should be noted here that the time information 
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may be taken from the mark points in the cell 
information and managed with information such as address 
taken from the time map table in the form of a table. 
By doing so, this information can be presented to the 
5 user as potential selections in a screen showing the 
pre-editing state . 

Reduced images (known as "thumbnails") may also be 
generated for each mark point and stored as separate 
files, with pointer information also being produced for 
10 each thumbnail. When displaying the cell information at 
the pre-editing stage, these thumbnails may be displayed 
to show the potential selections that can be made by the 
user . 

The processing of components such as the title 
15 reproduction control unit 23 (see Fig. 78) and the 

processing of the editing multi-stage control unit 26 
(Figs. 81 to 84) that was described in this fourth 
embodiment using flowcharts can be achieved by a machine 
language program. Such machine language program may be 
20 distributed and sold having been recorded on a recording 
medium. Examples of such recording medium are an IC 
card, an optical disc, or a floppy disc. The machine 
language program recorded on the recording medium may 
then be installed into a standard personal computer. By 
25 executing the installed machine language programs, the 
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standard personal computer can achieve the functions of 
the video data editing apparatus of this fourth 
embodiment . 

As a final note regarding the relationship between 
5 VOBs and original PGC information, it is preferable for 
one set of original PGC information to be provided for 
each VOB. 

Although the present invention has been fully 
described by way of examples with reference to 
10 accompanying drawings, it is to be noted that various 
changes and modifications will be apparent to those 
skilled in the art. Therefore, unless such changes and 
modifications depart from the scope of the present 
invention, they should be construed as being included 
therein . 

Industrial Field of Application 

The video data editing apparatus, optical disc, and 
recording medium storing an editing program of the present 
invention enable the editing of video images that are stored 
on an optical disc to be performed easily and in a short 
time. This is highly suitable for domestic video equipment, 
and creates a new market for domestic video editing 
apparatuses . 
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1 1. An optical disc, comprising: 

2 a data area for recording a file including at least one 

3 video object; and 

4 an index area for recording original type chain 

5 information and at least one set of user-defined type chain 

6 information, 

7 the original type chain information managing the file 

8 as an arrangement of a plurality of file sections, each file 

9 section being indicated by section boundaries that are a 

10 combination of any two of - 

11 (a) a start position of a video object, 

12 (b) an end position of a video object, and 

13 (c) at least one predetermined position within a video 

14 object, 

15 each set of user-defined type chain information 

16 indicating a plurality of file parts in the file and a 

17 reproduction route for the indicated file parts, 

18 each file part being indicating by part boundaries, the 

19 part boundaries being any of - 

20 (a) two of the section boundaries 

21 (b) one of the section boundaries and a position in a 

22 video object that differs from the predetermined positions, 

23 (c) two positions in a video object that differ from 

24 the predetermined positions, 

25 each reproduction route being independent of an order 



270 



WO 99/14935 



PCT/JP98/04174 



26 in which the indicated file parts appear in the file. 

1 2. The optical disc of Claim 1, 

2 wherein one out of the plurality of sets of user- 

3 defined type chain information is selected as an editing 

4 pattern, and 

5 the file recorded in the data area is processed so that 

6 the part boundaries indicated by the set of user-defined 

7 type chain information selected as the editing pattern 

8 become start positions and end positions for video objects 

9 in the file. 

1 3. The optical disc of Claim 2, 

2 wherein when one of the sets of user-defined type chain 

3 information is selected as the editing pattern, the original 

4 type chain information and sets of user-defined type chain 

5 information aside from the selected set of user-defined type 

6 chain information are all deleted from the 

7 index area, and the indicated set of user-defined type chain 

8 information is converted into original type chain 

9 information. 

1 4. The optical disc of Claim 3, 

2 wherein the original type chain information and the 

3 sets of user-defined type chain information are each a cell 

4 information sequence including a plurality of sets of cell 
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5 information, 

6 wherein the index area includes a search pointer area, 

7 the search pointer area including a set of identification 

8 information for each cell information sequence and a flag 

9 associated to each set of identification information, each 

10 flag being set at one of a first value to show that the 

11 corresponding cell information sequence is the original type 

12 chain information and a second value to show that the 

13 corresponding cell information sequence is a set of user- 

14 defined type chain information, and 

15 wherein when a user-defined type chain information is 

16 selected as the editing pattern, a setting of the flag 

17 corresponding to the selected set of user-defined type chain 

18 information changes from the second value to the first 

19 value. 



1 5. The optical disc of Claim 4, 

2 wherein each video object includes a plurality of video 

3 object units, each video object unit including a plurality 

4 of sets of picture data that are reproduced for a plurality 

5 of video frames for a certain reproduction period, 

6 wherein each set of cell information includes: 

7 time information for indicating a presentation start 

8 time and presentation end time for video data in a cell, the 

9 cell being one of a file section and a file part; and 

10 identification information for indicating the video 
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11 object to which the cell belongs, 

12 and wherein the optical disc further comprises a 

13 mapping region for recording mapping information that 

14 specifies the recording positions of video object units that 

15 include picture data that is to be reproduced for the time 

16 information in a set of cell information. 



1 6. A video data editing apparatus that uses an optical disc 

2 as an editing medium, 

3 the optical disc including: 

4 a data area for recording a file including at least one 

5 video object; and 

6 an index area for recording original type chain 

7 information and at least one set of user-defined type chain 

8 information, 

9 the original type chain information managing the file 

10 as an arrangement of a plurality of file sections, each file 

11 section being indicated by section boundaries that are a 

12 combination of any two of - 

13 (a) a start position of a video object, 

14 (b) an end position of a video object, and 

15 (c) at least one predetermined position within a video 

16 object, 

17 each set of user-defined type chain information 

18 indicating a plurality of file parts in the file and a 

19 reproduction route for the indicated file parts, 
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20 each file part being indicating by part boundaries, the 

21 part boundaries being any of - 

22 (a) two of the section boundaries 

23 (b) one of the section boundaries and a position in a 

24 video object that differs from the predetermined positions, 

25 (c) two positions in a video object that differ from 
2 6 the predetermined positions, 

27 each reproduction route being independent of an order 

28 in which the indicated file parts appear in the file, 

29 the video data editing apparatus comprising: 

30 reception means for receiving an operation from a user; 

31 processing means for processing the file, when the 

32 reception means has received an indication of a real edit 

33 operation for one of the sets of user-defined type chain 

34 information recorded on the optical disc, so that starting 

35 and ending part boundaries indicated in the indicated set of 

36 user-defined type chain information become boundaries of the 

37 video objects recorded on the optical disc; and 

38 updating means for updating, after processing by the 

39 processing means, the indicated set of user-defined type 

40 chain information in the index area to convert the indicated 

41 set of user-defined type chain information into the original 

42 type chain information. 

1 7. The video data editing apparatus of Claim 6, 

2 wherein when a real edit has been indicated for one of 
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3 the sets of user-defined type chain information, the 

4 updating means deletes the original type chain information 

5 and other sets of user-defined type chain information from 

6 the index area, and converts the indicated set of user- 

7 defined type chain information into the original type chain 

8 information. 

1 8, The video data editing apparatus of Claim 7, 

2 wherein the original type chain information and the 

3 sets of user-defined type chain information are each a cell 

4 information sequence including a plurality of sets of cell 

5 information, and the index area includes a search pointer 

6 area, the search pointer area including a set of 

7 identification information for each cell information 

8 sequence and a flag associated to each set of identification 

9 information, each flag being set at one of a first value to 

10 show that the corresponding cell information sequence is the 

11 original type chain information and a second value to show 

12 that the corresponding cell information sequence is a set of 

13 user-defined type chain information, and 

14 wherein when a real edit has been indicated for one of 

15 the sets of user-defined type chain information, the 

16 updating means updates a flag corresponding to the indicated 

17 set of user-defined type chain information from the second 

18 value to the first value. 
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1 9. The video data editing apparatus of Claim 8, 

2 wherein when the reception means has received an 

3 indication for a real edit for one of the sets of user- 

4 defined type chain information recorded on the optical disc, 

5 the processing means: 

6 reads a predetermined number of video object units 

7 including a start of a part and an end of a part; 

8 re-encodes the read video object units so that the read 

9 video object units become a start and an end of video 

10 objects which are reproduced seamlessly; and 

11 overwrites the data area using the re-encoded video 

12 object units. 

1 10. The video data editing apparatus of Claim 6, 

2 wherein each video object includes a plurality of video 

3 object units, each video object unit including a plurality 

4 of sets of picture data that are reproduced for a plurality 

5 of video frames for a certain reproduction period, 

6 wherein each set of cell information includes: 

7 time information including a presentation start time 

8 information and presentation end time information for video 

9 data in a cell, the presentation start time information 

10 showing a display video field of a set of picture data that 

11 should be reproduced first in a part of a video object and 

12 the presentation end time information showing a display 

13 video field of a set of picture data that should be 
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14 reproduced last in the part, and the cell being one of a 

15 file section and a file part; and 

16 identification information for indicating the video 

17 object to which the cell belongs, 

18 wherein the reception means receives a play indication 

19 for one of the sets of user-defined type chain information, 

20 and wherein the video data editing apparatus further 

21 comprises: 

22 access position specifying means 

23 for reading, when the reception means has received a 

24 play indication, mapping information and a pair of 

25 presentation start time information and presentation end 
2 6 time information from the index area, and 

27 for specifying, by searching the mapping information 

28 using the presentation start time information and the 

29 presentation end time information, a recording position of a 

30 start video object unit including picture data that is 

31 displayed for the presentation start time information and a 

32 recording position of an end video object unit including 

33 picture data that is displayed for the presentation end time 

34 information; 

35 reading means for reading a video object unit sequence 

36 recorded between the specified recording positions; and 

37 decoding means 

38 for decoding the read video object unit sequence and, 

39 for outputting, when part boundaries of a part 
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40 corresponding to a set of cell information in the indicated 

41 set of user-defined type chain information do not match a 

42 start position of the start video object unit and an end 

43 position of the end video object unit sequence, a decoding 

44 result between a first video field and an end video field 

45 indicated by the cell information, and for prohibiting 

4 6 output of a decoding result of data before the first video 

47 field and data after the end video field. 

1 11. An optical disc, comprising: 

2 a data area recording at least one video object, each 

3 video object including a plurality of video object units, 

4 each video object unit including a plurality of sets of 

5 picture data that are reproduced for a certain reproduction 

6 period; and 

7 an index area recording sets of cell information and 

8 mapping information, 

9 each set of cell information including presentation 

10 start time information and presentation end time 

11 information, the presentation start time information showing 

12 a display video field of a set of picture data that should 

13 be reproduced first in a part of a video object and the 

14 presentation end time information showing a display video 

15 field of a set of picture data that should be reproduced 

16 last in the part, and 

17 the mapping information enabling a recording position 
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18 of a video object unit to be specified based on time 

19 information, 

1 12. A reproduction apparatus for an optical disc, 

2 the optical disc comprising: a data area recording at 

3 least one video object, each video object including a 

4 plurality of video object units, each video object unit 

5 including a plurality of sets of picture data that are 

6 reproduced for a certain reproduction period; and an index 

7 area recording sets of cell information and mapping 

8 information, each set of cell information including 

9 presentation start time information and presentation end 

10 time information, the presentation start time information 

11 showing a display video field of a set of picture data that 

12 should be reproduced first in a part of a video object and 

13 the presentation end time information showing a display 

14 video field of a set of picture data that should be 

15 reproduced last in the part, and the mapping information 

16 enabling a recording position of a video object unit to be 

17 specified based on time information, 

18 the reproduction apparatus comprising: 

19 reception means for receiving an operation from a user; 

20 access position specifying means 

21 for reading, when the reception means has received a 

22 play indication, mapping information and a pair of 

23 presentation start time information and presentation end 
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24 time information from the index area, and 

25 for specifying, by searching the mapping information 

26 using the presentation start time information and the 

27 presentation end time information, a recording position of a 

28 start video object unit including picture data that is 

2 9 displayed for the presentation start time information and a 

30 recording position of an end video object unit including 

31 picture data that is displayed for the presentation end time 

32 information; 

33 reading means for reading a video object unit sequence 

34 recorded between the specified recording positions; and 

35 decoding means 

3 6 for decoding the read video object unit sequence and, 

37 for outputting, when part boundaries of a part 

38 corresponding to a set of cell information in the indicated 

39 set of user-defined type chain information do not match a 

40 start position of the start video object unit and an end 

41 position of the end video object unit sequence, a decoding 

42 result between a first video field and an end video field 

43 indicated by the cell information, and for prohibiting 

44 output of a decoding result of data before the first video 

45 field and data after the end video field. 

1 13. A computer-readable storage medium storing a video data 

2 editing program that uses an optical disc as an editing 

3 medium, 



280 



WO 99/14935 



PCT/JP98/04174 



4 the optical disc comprising: 

5 a data area for recording a file including at least one 

6 video object; and 

7 an index area for recording original type chain 

8 information and at least one set of user-defined type chain 

9 information, 

10 the original type chain information managing the file 

11 as an arrangement of a plurality of file sections, each file 

12 section being indicated by section boundaries that are a 

13 combination of any two of - 

14 (a) a start position of a video object, 

15 (b) an end position of a video object, and 

16 (c) at least one predetermined position within a video 

17 object, 

18 each set of user-defined type chain information 

19 indicating a plurality of file parts in the file and a 

20 reproduction route for the indicated file parts, 

21 each file part being indicating by part boundaries, the 

22 part boundaries being any of - 

23 (a) two of the section boundaries 

24 (b) one of the section boundaries and a position in a 

25 video object that differs from the predetermined positions, 

26 (c) two positions in a video object that differ from 

27 the predetermined positions, 

28 each reproduction route being independent of an order 

29 in which the indicated file parts appear in the file, 
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30 the editing program comprising the following steps: 

31 a reception step for receiving an operation from a 

32 user; 

33 a processing step for processing the file, when the 

34 reception step has received an indication of a real edit 

35 operation for one of the sets of user-defined type chain 

36 information recorded on the optical disc, so that starting 

37 and ending part boundaries indicated in the indicated set of 

38 user-defined type chain information become boundaries of the 

39 video objects recorded on the optical disc; and 

40 an updating step for updating, after processing by the 

41 processing step, the indicated set of user-defined type 

42 chain information in the index area to convert the indicated 

43 set of user-defined type chain information into the original 

44 type chain information. 



1 14. A computer-readable storage medium storing a 

2 reproduction program for an optical disc, 

3 the optical disc comprising: a data area recording at 

4 least one video object, each video object including a 

5 plurality of video object units, each video object unit 

6 including a plurality of sets of picture data that are 

7 reproduced for a certain reproduction period; and an index 

8 area recording sets of cell information and mapping 

9 information, each set of cell information including 

10 presentation start time information and presentation end 
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11 time information, the presentation start time information 

12 showing a display video field of a set of picture data that 

13 should be reproduced first in a part of a video object and 

14 the presentation end time information showing a display 

15 video field of a set of picture data that should be 

16 reproduced last in the part, and the mapping information 

17 enabling a recording position of a video object unit to be 

18 specified based on time information, 

19 the reproduction program comprising the following 

20 steps: 

21 a reception step for receiving an operation from a 

22 user; 

23 an access position specifying step 

24 for reading, when the reception step has received a 

25 play indication, mapping information and a pair of 

26 presentation start time information and presentation end 

27 time information from the index area, and 

28 for specifying, by searching the mapping information 

29 using the presentation start time information and the 

30 presentation end time information, for specifying a 

31 recording position of a start video object unit including 

32 picture data that is displayed for the presentation start 

33 time information and a recording position of an end video 

34 object unit including picture data that is displayed for the 

35 presentation end time information; 

36 a reading step for reading a video object unit sequence 
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37 recorded between the specified recording positions; and 

38 a decoding step 

39 for decoding the read video object unit sequence and, 

40 for outputting, when part boundaries of a part 

41 corresponding to a set of cell information in the indicated 

42 set of user-defined type chain information do not match a 

43 start position of the start video object unit and an end 

44 position of the end video object unit sequence, a decoding 

45 result between a first video field and an end video field 
4 6 indicated by the cell information, and for prohibiting 

47 output of a decoding result of data before the first video 

48 field and data after the end video field. 



1 16. An optical disc, comprising, 

2 a data area for recording a plurality of video 

3 segments; and 

4 a video area for recording original type chain 

5 information and user-defined type chain information, 

6 wherein the original type chain information defines an 

7 original type chain including first ones of the video 

8 segments by describing a reproduction order of the first 

9 video segments, and the user-defined type chain information 

10 defines a user-defined type chain including second ones of 

11 the video segments by describing a reproduction order of the 

12 second video segments, 

13 wherein the first video segments are positioned in the 
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14 data area in the reproduction order of the first video 

15 segments whereas the second video segments are positioned in 

16 the data area independently of the reproduction order of the 

17 second video segments. 
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